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Abstract 

We  study  pooled  (or  group)  testing  as  a  cost-effective  alternative  for  screening  do- 
nated blood  products  (sera)  for  HIV;  rather  than  test  each  sample  individually,  this  method 
combines  various  samples  into  a  pool,  and  then  tests  the  pool.  A  group  testing  policy  spec- 
ifies an  initial  pool  size,  and  based  on  the  HIV  test  result,  either  releases  all  samples  in  the 
pool  for  transfusion,  discards  all  samples  in  the  pool,  or  divides  the  pool  into  subpools  for 
further  testing.  We  develop  a  generalized  linear  model  that  relates  the  HIV  test  output  to 
the  antibody  concentration  in  the  pool,  and  hence  captures  the  effect  of  pooling  together 
different  samples.  The  model  is  validated  and  simplified  using  data  from  a  variety  of  field 
studies,  and  is  embedded  into  a  dynamic  programming  algorithm  that  derives  a  group  testing 
policy  to  minimize  the  expected  cost  due  to  false  negatives,  false  positives  and  testing.  A 
simulation  study  shows  that  significant  cost  savings  can  be  achieved  without  compromising 
the  accuracy  of  the  test.  However,  the  efficacy  of  group  testing  depends  upon  the  use  of  a 
classification  rule  (that  is,  discard  the  samples  in  the  pool,  transfuse  them  or  test  them  fur- 
ther) that  is  dependent  on  pool  size,  a  characteristic  that  is  lacking  in  currently  implemented 
pooled  testing  procedures. 
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In  the  first  years  of  the  AIDS  epidemic,  numerous  instances  of  AIDS  infection  caused 
by  blood  transfusion  were  reported  to  the  Center  for  Disease  Control.  The  incidence  indi- 
cated that  the  blood  supply  is  a  virtually  frictionless  pathway  for  spreading  the  epidemic, 
and  the  extent  of  the  epidemic  dictated  that  screening  at  the  individual  level  should  be 
adopted.  As  a  consequence,  all  infected  blood  donors  would  be  identified  and  a  measurably 
safer  blood  supply  would  be  attained.  Nevertheless,  the  cost  for  such  a  screening  program 
is  substantial,  and  many  developing  countries,  particularly  in  Africa  where  the  epidemic  is 
spreading  rapidly,  are  struggling  to  fight  the  disease  on  limited  budgets. 

Pooled  testing  is  one  potential  way  to  reduce  the  monetary  cost  without  compromising 
the  accuracy  of  the  tests.  The  rationale  behind  pooled  testing  is  simple  and  intuitive:  suppose 
we  can  pool  the  sera  from  ten  (for  example)  individuals  and  test  the  pool  using  a  single  test. 
If  the  seroprevalence  of  HIV,  which  is  the  fraction  of  the  population  that  is  infected,  is  low 
enough,  then  there  is  a  high  probability  that  all  ten  individuals  in  the  pool  are  HIV  negative; 
in  this  case,  we  would  learn  from  a  single  test  what  otherwise  would  be  learned  from  ten 
individual  tests.  If,  on  the  other  hand,  the  test  outcome  is  positive,  then  additional  tests 
(either  pooled  or  individual)  would  need  to  be  carried  out. 

However,  pooled  testing  has  a  possible  shortcoming,  the  dilution  effect:  there  is  a 
serious  concern  that  if  the  pool  size  is  too  large,  then  any  HIV  positive  sera  will  be  sufficiently 
diluted  so  as  to  become  undetectable  by  the  test.  These  false  negatives  can  be  extremely 
costly,  particularly  when  pooled  testing  is  employed  to  protect  the  blood  supply.  Moreover, 
infected  individuals  exhibiting  an  unusually  low  level  of  antibody  concentration  are  less 
likely  to  be  detected  when  screened  in  pools.  Consequently,  the  sensitivity  of  the  test 
can  be  seriously  affected.  (Sensitivity  is  the  probability  of  detecting  a  diseased  individual, 
whereas  specificity  is  the  probability  of  detecting  a  healthy  individual.) 

Pooling  methods  have  been  evaluated  in  blood  banking  systems  in  several  developing 


countries,  including  Zaire,  Zimbabwe  and  Ecuador  (see,  for  example,  Cahoon- Young  et  al. 
1988,  Emmanuel  et  al.  1988,  Kline  et  al.  1989,  Behets  et  al.  1990  and  Ledro-Monroy  et 
al.  1990).  These  field  studies  suggest  that  pooling  methods  (with  group  sizes  as  large  as 
80)  may  be  as  sensitive  and  specific  as  individual  testing,  and  can  result  in  cost  savings 
from  5%  to  80%,  depending  on  the  actual  seropre valence.  On  the  other  hand,  the  World 
Health  Organization  (WHO),  concerned  with  the  dilution  effect  and  its  consequences  on  the 
sensitivity  of  the  test,  is  more  conservative  in  their  proposal:  They  recommend  the  use  of 
pools  of  size  no  greater  than  five  in  Tamashiro  et  al.  (1993).  The  discrepancy  between  the 
field  studies  and  the  recommendations  of  the  WHO  is  an  indication  that  the  dilution  effect  is 
not  well  understood,  and  the  bloodbanking  community  is  in  danger  of  either  underestimating 
or  overestimating  its  importance. 

In  his  seminal  paper,  Dorfman  (1943)  showed  how  pooled  testing,  which  is  called  group 
testing  in  the  statistical  literature,  can  be  employed  to  efficiently  eliminate  all  defective  items 
from  certain  large  populations.  The  method  found  an  immediate  application  in  screening 
World  War  II  draftees  for  syphilis,  where  it  resulted  in  considerable  savings.  The  group 
testing  problem  was  researched  aggressively  in  the  1950's  and  60's,  (see,  for  example,  Sobel 
and  Groll  1959),  and  a  large  literature  now  exists  on  this  topic;  readers  are  referred  to 
Johnson  et  al.  (1991)  for  a  survey,  and  to  Litvak  et  al.  (1992)  for  recent  work  that  is 
motivated  by  HIV  testing.  However,  nearly  all  existing  studies  concentrate  on  either  perfect 
tests  (i.e.,  tests  with  no  misclassification  errors)  or  imperfect  tests  with  errors  that  are 
independent  of  the  group  size;  two  exceptions  are  Hwang  (1976)  and  Burns  and  Mauro 
(1987),  who  assume  that  test  sensitivity  is  a  specified  function  of  the  group  size.  In  addition, 
all  studies  neglect  the  actual  test  mechanism  and,  except  for  Arnold  (1977),  assume  that  the 
test  outcome  is  binary  rather  than  continuous. 

In  contrast,  we  attempt  to  explicitly  model  both  the  dilution  effect  and  the  continuous 


nature  of  the  test  outcome.  Our  task  is  greatly  complicated  by  the  fact  that  the  HIV  test 
outcome,  which  is  a  continuous  quantity  called  the  optical  density  level,  is  only  an  indirect 
measurement  of  the  unobservable  antibody  concentration.  Starting  from  first  principles, 
we  derive  a  generalized  linear  model  that  relates  the  optical  density  level  to  the  antibody 
concentration  of  the  tested  sera.  The  model  explicitly  captures  the  physical  pooling  of  sera, 
and  is  validated  using  data  from  two  existing  dilution  studies.  A  simplified  version  of  the 
model  is  validated  using  data  from  an  existing  pooling  study. 

Traditional  group  testing  problems  consider  a  binary  test  outcome;  if  the  test  is  HIV 
negative,  then  the  pool  is  released  for  transfusion,  and  if  the  test  is  HIV  positive,  then  the 
pool  is  divided  into  subpools  for  further  testing.  If  the  pool  consists  of  a  single  sample,  then 
the  sample  is  discarded  if  the  test  outcome  is  HIV  positive.  Hence,  traditional  group  testing 
policies  are  characterized  by  the  initial  pool  size  and  the  resulting  subpool  configuration. 
Because  we  explicitly  consider  the  continuous  nature  of  the  test  outcome,  our  group  testing 
policy  must  also  develop  a  classification  rule  that  is  based  on  the  test  outcome:  the  pool  is 
deemed  either  HIV  positive  (each  sample  in  the  pool  is  discarded)  or  HIV  negative  (each 
sample  in  the  pool  is  released  for  transfusion),  or  the  pool  is  divided  into  subpools  for  further 
testing.  Our  validated  pooling  model  is  embedded  into  a  dynamic  programming  framework 
that  derives  the  group  testing  policy  that  minimizes  the  expected  cost  due  to  testing,  false 
positives  and  false  negatives.  Our  proposed  policy  is  tested  on  a  Monte  Carlo  simulation 
model,  and  the  results  indicate  that  pooled  testing,  with  a  classification  rule  that  explicitly 
depends  on  the  pool  size,  can  achieve  significant  cost  savings  over  individual  testing. 

The  paper  is  organized  as  follows.  A  preliminary  description  of  EL  ISA,  the  biological 
assay  used  for  HIV  testing,  is  given  in  Section  1.  The  data  used  in  the  paper  are  described 
in  Section  2.  The  generalized  linear  model  is  developed  in  Section  3,  and  is  simplified  and 
validated  in  Section  4  using  the  data  described  in  Section  2.     A  dynamic  programming 


framework  for  the  group  testing  problem  is  developed  in  Section  5,  and  several  policies  are 
derived.  A  simulation  study  is  undertaken  in  Section  6,  and  concluding  remarks  appear  in 
Section  7. 

1.  Serological  Tests  for  AIDS 

The  human  body  reacts  to  microbial  agents,  like  viruses,  bacteria,  parasites,  etc.,  by 
producing  antibodies.  The  antibodies  recognize  particular  molecules  on  the  surface  of  the 
infectious  agent  and  bind  to  them.  Such  molecules  are  called  antigens  (anybody  generators). 
Various  immunological  tests  are  designed  to  detect  antibodies,  thereby  identifying  the  sero- 
logical status  of  the  individual. 

The  Human  Immunodefficiency  Virus  (HIV)  is  the  pathological  agent  of  the  Acquired 
Immune  Deficiency  Syndrome  (AIDS).  Enzyme  Linked  Immunosorbent  Assays  (ELISA)  for 
the  HIV  virus  detect  the  anti-HIV  antibodies  and  are  frequently  used  for  HIV  screening. 
This  section  contains  a  brief  nontechnical  description  of  ELISAs. 

A  common  configuration  of  ELISAs  for  HIV  is  the  indirect  assay  pictured  in  Figure  1 
(see  George  and  Schochetman  1985  for  more  details).  Antigens  to  HIV  are  attached  to  a  solid 
phase  support  (usually  wells).  The  patient's  serum  (or  plasma)  is  diluted  (at  a  dilution  fixed 
by  the  manufacturer),  added  to  the  solid  phase  support  and  incubated  for  a  time  period. 
By  the  end  of  the  incubation  period,  any  antibodies  to  HIV  that  are  present  in  the  sample 
are  attached  to  the  antigens  on  the  solid  phase  support.  The  well  is  then  washed  so  that 
all  unattached  material  is  removed,  and  the  attached  antibodies  become  detectable.  The 
attached  antibodies  (immunoglobulins)  are  detected  when  a  secondary  antibody,  labeled  by 
an  enzyme,  is  added.  When  a  substrate  is  finally  added,  an  enzymatic  reaction  takes  place 
producing  a  color  change  proportional  to  the  amount  of  human  HIV  antibodies  present.  The 
ELISA  test  outcome  is  the  optical  density  (OD)  level,  which  quantifies  this  color  change. 
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Figure  1:  A  schematic  representation  of  the  indirect  ELISA. 


Hence,  the  OD  reading  is  determined  by  two  factors:  the  concentration  of  the  antibodies 
and  their  ability  to  bind  on  the  antigens  on  the  solid  support  (antibody  affinity).  If  the  OD 
recorded  at  the  end  of  the  process  exceeds  the  critical  value,  or  cutoff,  recommended  by  the 
manufacturer,  then  the  patient  is  declared  HIV  positive;  otherwise  the  patient  is  declared 
HIV  negative. 

An  alternative  configuration  of  ELISAs  is  the  competitive  assay.  Although  the  same 
types  of  antigens  used  in  indirect  ELISAs  are  attached  to  the  solid  phase  support,  this 
method  differs  from  the  indirect  one  in  the  detection  mechanism.  Enzyme  labeled  HIV 
antibodies  compete  with  the  patient's  antibodies  for  binding  sites.  The  color  change  observed 
is  inversely  proportional  to  the  concentration  of  HIV  antibodies  in  the  serum.  If  the  recorded 
OD  level  exceeds  the  critical  value  set  by  the  manufacturer,  then  the  sample  is  declared 
negative,  otherwise  positive.  We  concentrate  on  indirect  assays  in  this  paper,  since  most  of 
the  commercially  available  antibody  detection  kits  are  based  on  the  indirect  configuration 
of  ELISAs.  Nevertheless,  the  study  of  competitive  assays  is  not  any  more  difficult  and  only 
minor  modifications,  stated  when  necessary,  are  required. 


ELISAs  axe  inexpensive,  easy  to  administer  and  very  accurate;  however,  they  have 
a  shortcoming  that  stems  from  the  test's  indirect  detection  of  HIV  via  the  presence  of 
antibodies.  The  patient's  time  of  infection  is  followed  by  a  window  period  during  which  the 
antibody  concentration  in  the  patient's  serum  is  virtually  undetectable.  This  period  usually 
extends  from  three  to  nine  months  and  results  in  false  negatives.  Assays  for  detecting  HIV 
antibodies  cannot  identify  such  individuals;  therefore,  whenever  individuals  are  referred  to 
as  positive  or  negative,  we  are  actually  alluding  to  the  presence  or  absence  of  HIV  antibodies. 

2.  Description  of  the  Data 

We  use  individual  testing  data,  dilution  series  data  and  pooled  testing  data  obtained 
from  three  independent  sources. 

Individual  Testing  Data.  OD  readings  for  4000  HIV  negative  and  3000  HIV  pos- 
itive individuals  screened  using  four  different  assays  were  provided  by  the  National  HIV 
Reference  Laboratory  of  Australia  (Dax  1993).  It  is  convenient  to  normalize  the  OD  read- 
ings according  to  the  equation  x  =  ^D~^Q,  so  that  they  fall  between  zero  and  one;  A0  and 
Am  are  the  minimum  and  maximum  OD  readings,  respectively,  recorded  by  the  assay.  The 
values  of  A0  and  Am  vary  by  assay,  and  were  chosen  based  on  an  analysis  of  the  data  and 
discussions  with  the  data  providers.  For  this  data,  we  set  A0  =  0  and  Am  =  20. 

The  empirical  distributions  for  the  normalized  OD  readings  for  assay  A  (an  indirect 
ELISA)  are  given  in  Figure  2(a).  We  observe  that  both  the  mean  and  variance  are  smaller 
for  HIV  negative  than  for  HIV  positive  individuals.  The  relatively  large  spread  in  the  HIV 
positive  distribution  is  to  be  expected,  since  an  individual's  antibody  concentration  tends 
to  systematically  vary  as  the  disease  progresses;  see  George  and  Schochetman  for  details. 
The  two  populations  are  well  separated,  and  therefore  a  critical  value  separating  the  OD 
outcomes  into  HIV  positive  and  HIV  negative  can  be  selected. 
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Figure  2:   (a)  Empirical  densities  for  the  reactivity  ratios  of  4000  HIV  negative  and  3000 
HIV  positive  individuals,  and  (b)  densities  for  the  corresponding  LOD  values. 


For  reasons  that  will  become  clear  in  Section  4,  we  also  consider  the  logit  transfor- 
mation of  the  normalized  OD  readings:  x  — »■  ln(yf^),  which  will  be  referred  to  as  the  LOD 
(logit  OD)  readings.  The  empirical  densities  of  the  LODs  for  the  two  populations  are  given 
in  Figure  2(b).  The  sample  mean  and  standard  deviation  are,  respectively,  /z_  =  —4.82  and 
<7_  =  0.42  for  the  HIV  negative  population,  and  fj,+  =  0.80  and  a+  =  1.08  for  the  HIV 
positive  population. 

Figure  3  displays  the  normal  quantile  plot  for  the  empirical  distributions  in  Fig- 
ure 2(b);  that  is,  the  LOD  readings  are  ranked  in  magnitude  and  are  plotted  against  the 
standard  normal  quantiles.  A  straight  line  indicates  normality  of  the  data  points.  The  quan- 
tile plot  of  the  LOD  readings  is  approximately  linear  for  both  populations.  Deviations  from 
normality  are  observed  in  the  tails  of  the  HIV  negative  population  and  the  right  tail  of  the 
HIV  positive  population.  Most  importantly,  the  normal  approximation  captures  the  left  tail 
of  the  HrV  positive  distribution,  which  contains  the  low  OD  readings  that  might  become  un- 
detectable under  pooled  testing.  On  the  other  hand,  the  normal  approximation  to  the  HIV 
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Figure  3:  NormaJ  quantiles  for  the  LOD  readings  of  (a)  HIV  negative  and  (b)  HIV  positive 
populations. 


negative  population  underestimates  the  proportion  of  negative  individuals  with  a  relatively 
high  OD  reading,  which  might  lead  to  an  underestimation  of  false  positives.  Nevertheless, 
the  false  negatives,  which  are  the  overriding  concern  in  pooled  testing,  will  not  be  affected. 

In  the  analytical  model  developed  in  Section  5,  we  assume  that  the  LOD  readings  for 
the  HIV  negative  and  positive  populations  are  normally  distributed  with  respective  means 
/i_  =  —4.82  and  fi+  =  0.80,  and  respective  standard  deviations  o_  —  0.42  and  a+  =  1.08. 

Dilution  Series  Data.  Dilution  series  data  were  obtained  from  the  Caribbean 
Epidemiology  Center  (Hull  1991  and  de  Gourville  1992)  and  the  National  HIV  Reference 
Laboratory  of  Australia  (Dax).  The  purpose  of  both  of  these  studies  was  to  investigate  the 
effect  of  dilution  on  the  ability  of  ELISAs  to  detect  reactive  sera. 

In  the  Caribbean  Epidemiology  Center  (CAREC)  study,  ten  positive  sera  were  diluted 
sequentially  in  a  fixed  negative  serum  to  produce  a  series  of  thirteen  four-fold  dilutions  in 
the  ratios  1  :  1, 1  :  4, 1  :  16,. . . ,  1  :  412.  A  1  :  n  ratio  means  that  £  of  the  pool  consists 
of  the  positive  sample.   Each  dilution  was  tested  by  two  indirect  ELISAs  according  to  the 


manufacturer's  instructions.  Since  the  data  from  both  assays  yielded  similar  results,  we  only 
report  the  results  from  one  of  them.  The  raw  data  consists  of  130  OD  readings,  one  for 
each  of  the  thirteen  dilution  levels  of  each  of  the  ten  positive  samples.  We  used  A0  =  0  and 
Am  =  15  to  normalize  the  OD  readings. 

The  National  HIV  Reference  Laboratory  of  Australia  (NRL)  study  sequentially  diluted 
ten  positive  sera  in  a  fixed  negative  serum  to  produce  a  series  of  11  two-fold  dilutions,  with 
ratios  1  :  1,1  :  2, 1  :  4, . . . ,  1  :  210.  These  dilutions  were  tested  on  ten  different  assays.  We 
analyzed  the  data  from  several  of  these  assays  and  obtained  very  similar  results,  and  hence 
will  only  report  on  the  results  from  one  assay.  The  OD  readings  were  normalized  using 
Aq  =  0  and  A2  =  2. 

Pooled  Testing  Data.  Cahoon- Young  et  al.  (1992),  which  will  be  referred  to 
hereafter  as  Cahoon- Young  et  al.,  tested  1280  specimens  individually  and  in  a  series  of 
nested  pools.  More  specifically,  the  individual  specimens  were  pooled  to  generate  128  pools 
of  size  10;  the  pools  of  size  10  were  then  combined  to  form  64  pools  of  size  20,  then  32  pools 
of  size  40  and  finally  16  pools  of  size  80.  Twelve  individuals  were  found  to  be  HIV  positive, 
and  no  more  than  one  positive  sample  was  found  in  any  of  the  pools  of  size  80.  The  OD 
readings  at  every  stage  of  this  nested  testing  procedure  were  recorded. 

Note  that  the  dilution  series  studies  by  CAREC  and  NRL  differ  from  Cahoon  Young  et 
al.'s  pooling  study  in  one  important  respect:  positive  sera  are  diluted  with  varying  amounts 
of  the  same  negative  sera  in  the  dilution  series  studies,  whereas  individual  sera  are  combined 
with  a  varying  number  of  different  individual's  sera  in  Cahoon- Young  et  al.'s  pooling  study. 
Hence,  although  the  two  dilution  series  can  be  used  to  assess  the  effect  of  dilution,  the 
Cahoon-Young  et  al.  study  exactly  mimics  the  pooling  that  would  take  place  under  group 
testing. 


3.  A  Probabilistic  Model  for  the  Dilution  Effect 

When  sera  are  screened  in  pools,  the  OD  reading  is  determined  by  the  concentration 
and  affinity  of  the  antibodies  in  the  pool.  The  unobservability  of  the  antibody  concentration 
and  affinity  makes  it  very  difficult  to  estimate  the  OD  level  of  a  pool.  In  this  section, 
we  develop  a  stochastic  model  that  predicts  the  OD  level  of  a  sample  as  a  function  of 
the  antibody  concentration  and  affinity.  We  then  specialize  the  model  to  the  setting  of 
the  CAREC  and  NRL  dilution  studies,  and  obtain  a  generalized  linear  model  (GLM)  that 
predicts  the  OD  level  of  a  pool  as  a  function  of  the  OD  level  of  the  HIV  positive  sample 
and  the  corresponding  dilution  level.  The  model  is  adapted  in  Section  4  to  consider  pools 
consisting  of  individual  samples,  as  in  the  Cahoon- Young  et  al.  study. 

The  dilution  series  data  of  CAREC  and  NRL  essentially  generate  dose-response  curves: 
the  dose  takes  the  form  of  a  fixed  positive  sample  diluted  to  various  levels,  and  the  response 
is  simply  the  corresponding  OD  reading.  Empirical  dose-response  curves  typically  exhibit 
sigmoid  or  hyperbolic  behavior,  and  polynomial,  general  curvilinear,  logit  or  probit  regres- 
sion models  have  been  proposed  to  fit  these  curves.  Before  our  model  is  introduced,  it  is 
worthwhile  reviewing  the  traditional  approach,  and  we  focus  on  the  logistic  regression  for 
concreteness.  Let  V}  denote  the  LOD  reading  (that  is,  the  logit  of  the  normalized  OD  read- 
ing, as  in  Figure  2(b))  of  a  particular  HIV  positive  sample  that  is  diluted  to  the  ratio  1  :  dJ , 
where  d  is  an  integer  (d  =  4  for  CAREC  and  d  =  2  for  NRL)  and  j  =  0, 1, . . . ,  n.  The  linear 
regression  model  hypothesizes  that 

Yj^a  +  Pj  +  ej,  (1) 

where  Cj  are  iid  normal  random  variables  with  zero  mean.  Although  this  model  typically 
generates  predicted  values  that  coincide  well  with  observed  values,  it  exhibits  considerable 
heteroscedasticity  (state-dependent  noise),  and  hence  one  of  the  model's  basic  assumptions 
is  violated  (see  Tijssen  1985,  Chapter  15). 
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Whereas  the  existing  literature  has  taken  a  purely  empirical  approach  to  fitting  dose- 
response  curves,  we  develop  a  probabilistic  model  that  is  based  upon  a  set  of  primitive 
assumptions  regarding  the  behavior  of  the  ELISA  test  and  the  pooled  sera.  Our  analysis 
leads  to  a  GLM  for  the  dose-response  curve:  recall  that  while  a  linear  regression  model 
postulates  that  the  expected  value  of  the  dependent  variable  is  a  linear  function  of  the 
independent  variable,  a  GLM  assumes  that  a  function  of  the  expected  value  of  the  dependent 
variable  is  a  linear  function  of  the  independent  variable.  Like  model  (1),  our  GLM  captures 
the  sigmoid  nature  of  the  dose-response  curve.  In  addition,  it  proposes  a  particular  variance 
function  that,  as  will  be  seen  in  the  next  section,  stabilizes  the  heterogeneous  noise  present 
in  the  CAREC  and  NRL  data  sets. 

Our  model  is  influenced  by  Fisher  (1922),  who  developed  a  probabilistic  model  to 
estimate  the  number  of  bacteria  in  a  sample  of  water  or  soil.  The  following  eight  assumptions 
are  used  to  derive  our  model.  We  conferred  with  several  specialists,  and  none  of  these 
assumptions  generated  any  disputation;  assumption  5  was  the  only  one  that  appeared  to 
stimulate  any  reflection. 

Al.     The  number  of  HIV  antigens,  n,  attached  to  a  well  satisfies  n  >  106. 

A2.      No  more  than  one  HIV  antibody  can  bind  to  any  antigen. 

A3.  The  probability  of  an  antibody  binding  to  a  specific  antigen  is  small,  independent  for 
each  antigen,  and  constant  for  antibodies  from  the  same  serum.  The  expected  number 
of  attached  antibodies  on  a  large  collection  of  antigens  is  significant. 

A4.      A  secondary  antibody  will  bind  to  all  attached  primary  antibodies. 

A5.  The  normalized  OD  reading  is  linearly  proportional  to  the  number  of  attached  anti- 
bodies. 


A6.  The  expected  number  of  attached  HIV  antibodies  is  linearly  proportional  to  the  anti- 
body concentration.  The  proportionality  constant  can  vary  among  different  individuals 
due  to  differences  in  their  binding  properties  (affinity). 

A7.  The  antibody  concentration  of  pooled  sera  is  the  weighted  average  of  the  individual 
antibody  concentrations. 

A8.      Measurement  errors  are  negligible. 

If  a  competitive  assay  is  employed,  then  assumption  A5  is  replaced  by 

A5c.      The  normalized  OD  reading  is  proportional  to  the  number  of  attached  secondary 
antibodies, 

and  the  following  assumption  is  introduced: 

A9.  The  affinity  of  primary  antibodies  is  higher  than  that  of  secondary  antibodies;  therefore, 
secondary  antibodies  will  bind  to  antigens  on  the  solid  support  if  and  only  if  there  are 
not  enough  primary  antibodies  to  saturate  the  binding  sites. 

In  this  case,  the  subsequent  model  derivation  follows  virtually  unchanged. 

Motivated  by  Fisher's  analysis,  we  consider  a  well  with  n  antigens  bound  on  it,  and 
introduce  a  partition  of  the  well  into  k  subwells  indexed  s  =  1 , . . . ,  k.  Assuming  that  the 
antigens  are  uniformly  bound  on  the  well,  we  let  m  =  £  denote  the  number  of  antigens  on 
every  subwell.  Suppose  that  serum  i  is  diluted  with  an  HIV  negative  serum  in  the  ratio  1  :  d? , 
and  then  added  to  the  well.  Since  neither  the  concentration  nor  the  affinity  of  antibodies 
is  observable,  and  since  their  net  effect  is  multiplicative  in  nature  by  A6,  we  will  hereafter 
refer  to  the  product  of  the  antibody  concentration  and  the  antibody  affinity  as  the  antibody 
concentration.  Let  pi0  denote  the  antibody  concentration  for  the  undiluted  serum  i,  pica  be 
the  antibody  concentration  of  the  HIV  negative  serum,  p^  be  the  antibody  concentration 
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for  the  diluted  serum,  and  pXJ  be  the  binding  probability  for  the  antibodies  in  the  serum; 
note  that  none  of  these  quantities  are  observable.  Also,  let  NlJS  represent  the  number  of 
antibodies  attached  to  the  antigens  on  subwell  s. 

Our  model  development  consists  of  three  main  steps:  First,  we  find  the  probability 
distribution  for  S^*  =  (JVyi  +  . .  ■  +  Nljk)/k,  which  is  the  average  number  of  attached  antibod- 
ies per  subwell,  in  terms  of  the  unknown  binding  probability  p,r  Then  we  use  assumption 
A6  to  relate  the  unknown  binding  probability  ptJ  to  the  unknown  antibody  concentration  in 
the  diluted  serum.  Finally,  we  use  assumption  A5  to  relate  the  average  number  of  attached 
antibodies  per  subwell  Sl}k  to  the  normalized  OD  reading.  Combining  these  three  steps 
yields  our  basic  model. 

By  our  comments  above,  /VtJi, . . . ,  NtJk  are  independent  binomial  random  variables 
with  size  parameter  m  and  success  probability  piy  By  A3  and  the  law  of  rare  events,  the 
binomial  random  variable  can  be  approximated  by  a  Poisson  random  variable: 

P(Nijs  =  k)*e-"^^,  (2) 

where  Pi]m  =  mpij.  We  can  choose  a  sufficiently  fine  partition  of  the  well  such  that  P(NlJS  > 
1)  =  o(pijm),  implying 

P(NlJS  =  0)    a    e-p— ,  (3) 

P(NlJS  =  l)    «    l-e_p"m    and  (4) 

P(Nijs>l)    a    o(Pl]m).  (5) 

By  the  Central  Limit  Theorem  and  (3)-(5), 

Sijkl™  N(\  -  e-p'J"-,le-p^(l  -  e-p>""))   as   k  -+  oo,  (6) 

k 

and  hence 

ln(£(l  -  Sijk))  =  -pijm.  (7) 
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Since  ]C*=i  ^ijs  is  a  binomial  random  variable,  the  distribution  of  Sl]k  is  well  approximated 
by  the  normal  distribution  even  for  relatively  small  values  of  k  (typically  k  >  15).  In  Section 
4,  the  parameters  of  our  resulting  model  are  estimated  from  the  data,  and  k  approximately 
equals  20. 

Assumption  A6  implies  that 

Pijrn  =  -jr,  (8) 

which  relates  the  binding  probability  to  the  antibody  concentration  in  the  diluted  serum. 

Let  XXj  denote  the  normalized  OD  reading  of  the  diluted  sample.  Then  A5  implies 
that  Xtj  =  'ySi-jk,  where  7  is  the  constant  of  proportionality.  The  maximum  normalized 
OD  reading,  Xi}  —  1,  is  attained  when  antibodies  are  bound  on  all  subwells.  In  this  case, 
Sijk  =  1  by  (3)- (5),  and  so  7  =  1  and 

Xij  =  Sijk-  (9) 

Combining  equations  (6)  and  (8)-(9)  and  taking  logarithms  gives  the  basic  stochastic  model 
relating  the  normalized  OD  reading  to  the  antibody  concentration: 

Xa-NiEiXiil^EiXdil-ElXv])),  (10) 

where 

ln(-ln(l-E[XtJ]))=ln(^).  (11) 

In  Section  4,  this  basic  model  will  be  adapted  to  the  Cahoon- Young  et  al.  pooling 
study.  Now  we  specialize  this  model  to  the  setting  of  the  CAREC  and  NRL  dilution  studies. 
By  A7,  the  antibody  concentration  in  the  diluted  serum  is 

pl0  +  (dJ  -  l)pioo 


P«j 


dj 

~      PlO(1+^Aoc)_  (12) 

dJ  p.o 
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Combining  equations  (11)  and  (12),  and  using  the  first  order  Taylor  series  approximation 
ln(l  4-  x)  «  0  (the  adequacy  of  this  approximation  will  be  investigated  during  the  model 
validation  phase)  gives  the  GLM 

ln(- ln(l -£[*„]))=  In  (^)-j  In d.  (13) 

The  random  component  of  the  model  is  the  normalized  OD  level  XXj  of  the  diluted  sample, 
which  is  normally  distributed  by  (10).  The  systematic  component  is  the  dilution  level  j. 
The  link  between  the  random  component  and  the  systematic  component  is  of  the  form 

g(fMj)  =  ai  +  0j  (14) 

where  (Mj  =  E(Xij),  the  link  function  is  g  :  x  — ►  ln(—  ln(l  —  x)),  the  complementary  log-log 
(cloglog),  and  oii  =  ln(^)  and  j3  =  —  \nd  are  constants.  The  second  moment  of  the  random 
component  is  given  by  Var{XX])  =  4>Hi3(l  —  ^tJ),  which  will  be  denoted  by  V(f^ij),  where  the 
dispersion  parameter  <t>  equals  £. 

We  conclude  this  section  with  several  remarks  about  the  GLM  (13).  It  captures 
the  sigmoid  nature  of  the  dose-response  curve  via  the  cloglog  link  function.  Other  suitable 
sigmoid  link  functions  are  the  logit  and  probit.  The  logit  link  is  defined  by  g  :  x  — ►  ln(y^) 
and  the  probit  by  g  :  x  — ►  $_1(;c),  where  $  is  the  cumulative  standard  normal  distribution. 
To  obtain  the  best  fit  of  the  data,  GLMs  for  all  three  link  functions  will  be  considered  in 
Section  4.  Notice  that  if  we  replace  the  cloglog  function  by  the  logit  function  in  (11)  and 
set  the  dilution  level  j  —  0,  then  this  equation  implies  that  individual  LOD  readings  are 
normally  distributed,  which  is  consistent  with  the  assumption  that  was  discussed  in  Section 
2,  and  that  will  be  used  in  Section  5. 

The  y-axis  intercept  ai  corresponds  to  the  cloglog  of  the  normalized  OD  level  of  the 
positive  sample,  and  hence  provides  a  measure  of  the  antibody  concentration  of  the  original 
positive  sample;  our  model  predicts  that  ax  =  }n(pl0/k),  which  is  perfectly  consistent  with 
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this  interpretation.  The  slope  (3  is  the  marginal  change  in  response  due  to  a  change  in  the 
dilution  level  j;  hence,  0  =  —  In  d  is  also  consistent  with  this  interpretation.  Notice  that  pi0 
and  k  are  not  observable,  and  d  is  observable.  Hence,  the  slope  of  the  model  is  known,  but 
the  y— intercept  and  the  dispersion  parameter  are  unknown.  In  the  next  section,  we  fit  the 
GLM  to  the  CAREC  and  NRL  data  sets.  Unfortunately,  it  is  very  tedious  to  estimate  a, 
and  4>  for  a  fixed  slope  /3  =  —  \nd.  Therefore,  we  will  estimate  a,/5  and  <j>  from  the  model, 
and  see  if  the  predicted  slope  is  close  in  value  to  —  In  d. 

4.  Model  Validation 

In  this  section,  we  attempt  to  validate  the  GLM  developed  in  Section  3.  In  Subsection 
4.1,  the  parameters  of  the  model  are  estimated  using  the  dilution  series  studies  by  CAREC 
and  NRL.  The  GLM  is  adapted  to  the  pooling  setting  and  simplified  in  Subsection  4.2.  The 
simplified  pooling  model  is  validated  on  Cahoon-Young  et  al.'s  data  in  Subsection  4.3. 

4.1  Model  Fitting 

The  dilution  series  studies  undertaken  by  CAREC  and  NRL  generate  normalized  OD 
values  Xij  for  positive  sample  i  and  dilution  level  j.  This  data  will  be  used  to  estimate 
the  values  of  the  generalized  linear  model  parameters  a,/3  and  0.  If  the  random  mechanism 
embodied  in  the  GLM  is  the  true  process  by  which  the  data  are  generated,  then  the  maximum 
likelihood  estimators  can  be  obtained  by  iterative,  weighted  least  squares.  However,  the 
normality  assumption  on  X^  is  approximate,  and  we  can  relax  this  assumption  by  employing 
the  theory  of  quasilikelihood  functions  (see  pp.  323-352  of  McCullagh  and  Nelder  1989). 

This  theory  applies  under  the  following  four  conditions  that  are  satisfied  by  the  GLM: 
(i)  the  range  of  possible  normalized  OD  values  X,j  is  known,  (ii)  the  mean  normalized  OD 
level  is  specified  as  a  function  of  the  dilution  level  j,  (iii)  the  variance  of  the  normalized 
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OD  is  specified  as  a  function  of  the  mean  01),  and  (iv)  the  observations  are  statistically 
independent.  Let  us  fix  sample  i,  and  let  Xt  =  (A,0,  A,i, . . . ,  Xin)  denote  the  random 
vector  of  normalized  OD  readings,  and  assume  that  the  AtJ's  are  independent  with  mean 
/ijj  and  variance  V(/itJ).  Then  the  log-likelihood  function  for  /ztJ  can  be  replaced  by  the 
quasilikelihood  function 

where  xtJ  is  the  realization  of  A^.  The  maximum  likelihood  estimators  (MLE)  for  the  model 
parameters  are  then  obtained  by  maximizing  the  quasilikelihood  function 

QM=W"'x-twdv  <16) 

for  each  study,  where  n  —  12  for  CAREC  and  n  =  10  for  NRL.  We  use  the  glm  routine  of 
S-plus  (see  Hastie  and  Pregibon  1992)  to  obtain  MLEs  for  the  dispersion  parameter  <fi,  the 
slope  /5  and  the  y— intercept  a,,  i  =  1, . . . ,  10  for  each  study.  The  predicted  values  jlij  are 
the  mean  response  values  predicted  by  the  model,  and  are  given  by 

ln(r^-)=dl  +  /?J.  (17) 

The  Pearson  residuals  are  defined  as  ^.'C^y . 

Since  most  GLM  diagnostics  are  visual,  we  begin  by  analyzing  the  scatter  plot  of  the 
predicted  vs.  observed  values  and  the  Pearson  residual  plots.  Our  analysis  is  illustrated  using 
the  logit  link  function.  The  scatter  and  residual  plots  for  CAREC  and  NRL  are  given  in 
Figure  4.  No  significant  deviations  from  the  predicted  fit  (the  dotted  line)  are  observed  in  the 
scatter  plots.  The  observations  are  fairly  uniformly  spread  along  the  fitted  line  and  no  outliers 
are  detected.  It  is  worth  noting  that  the  traditional  linear  regression  model  (1)  was  also  fit 
to  the  data,  and  severe  heteroscedasticity  was  present;  hence,  the  variance  function  V{n) 
appears  to  stabilize  the  residuals,  giving  an  almost  uniform  spread  of  the  residuals  around 
zero.  Moreover,  as  illustrated  in  Figure  5,  the  three  link  functions  under  consideration  are 
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Figure  4:  Residual  and  scatter  plots  for  CAREC  (a,c)  and  NRL  (b,d). 

nearly  equivalent,  in  the  sense  that  the  resulting  models  predict  very  similar  values;  data  in 
Figure  5  are  from  the  NRL  study,  which  exhibited  less  agreement  among  the  link  functions 
than  the  CAREC  study.  Hereafter,  our  analysis  will  employ  the  logit  link  function.  In 
summary,  the  GLM  provides  a  reasonably  good  fit  to  the  available  data,  particularly  for  the 
CAREC  data  set,  and  several  diagnostics  did  not  reveal  any  serious  model  inconsistencies. 
To  assess  the  predictive  power  of  the  model,  we  use  the  LOD  readings  at  dilution 
levels  j  =  1,2,6,7,11,12  from  CAREC  and  j  =  1,2,5,7  from  NRL  to  obtain  the  MLEs 
for  the  logit  model.  The  predicted  values  obtained  for  the  remaining  data  (i.e.,  dilution 
levels  j  =  3, 4, 5, 8, 9, 10  for  CAREC  and  j  =  3, 4, 6, 8, 9, 10  for  NRL)  are  plotted  against  the 
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Figure  5:  Scatter  plots  for  the  response  predicted  by  the  logit,  cloglog  and  probit  link 
functions. 

observed  values  in  Figure  6.  The  predictive  power  of  the  model  is  verified  by  observing  that 
nearly  all  the  observed  points  lie  within  the  95%  confidence  interval  predicted  by  the  model. 

The  visual  diagnostics  can  be  supplemented  by  quantitative  diagnostics  based  on  the 
residual  deviance,  which  is  twice  the  log-likelihood  ratio.  In  particular,  the  goodness-of-fit 
can  be  assessed  quantitatively  by  comparing  the  proposed  GLM  to  the  full  model  that  has  as 
many  parameters  as  observations.  The  residual  deviance  is  asymptotically  x2  with  degrees 
of  freedom  given  by  the  difference  between  the  number  of  observations  and  the  number 
of  model  parameters.  Table  1  shows  the  95%  confidence  intervals  for  /3,  the  MLE  for  the 
dispersion  parameter  0,  the  degrees  of  freedom  of  the  x2  statistic  and  the  residual  deviance. 
The  quality  of  the  best  fit  is  ascertained  by  the  significance  level  of  the  x2  statistic,  which 
is  greater  than  0.999  in  all  cases. 

Recall  that  equation  (13)  predicts  that  0=  —bid.  Since  the  95%  confidence  intervals 
in  Table  1  do  not  contain  -In 4  =  -1.386  for  CAREC  and  -In 2  =  -0.693  for  NRL,  we 
deduce  that  —  In  d  is  not  an  accurate  prediction  of  the  slope  that  best  fits  the  GLM.  However, 
by  deriving  an  upper  bound  on  the  deviance  of  the  fixed  slope  GLM,  we  can  show  that  even 
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Figure  6:   Verifying  the  predictive  power  of  the  GLM:  The  observed  values  lie  within  the 
95%  confidence  interval  predicted  by  the  model. 

for  0  =  —  \nd,  the  GLM  accurately  describes  the  data.  The  bound  is  obtained  by  replacing 
the  actual  MLEs  by  a  set  of  readily  available  suboptimal  estimates. 


CAREC 

NRL 

Model 

95%  CI  for  0 

<t> 

d.f. 

x2 

95%  CI  for  0 

<t> 

d.f. 

x2 

cloglog 

logit 

probit 

-0.6020  ±  0.065 
-0.8323  ±  0.078 
-0.4883  ±0.041 

0.065 
0.042 
0.042 

68 
68 
68 

3.2130 
2.6393 
2.6802 

-0.3081  ±  0.042 
-0.3436  ±  0.052 

-0.1880  ±0.032 

0.052 
0.056 
0.065 

88 
88 
88 

4.4103 
4.4758 
5.3035 

Table  1:  GLM  parameter  estimates  and  goodness  of  fit. 

Let  cti  and  $  be  the  MLEs  for  the  variable  slope  GLM  (14),  and  let  a*  be  the  MLEs 
for  the  GLM  that  has  the  slope  fixed  at  (3  =  —  lnd.  The  deviance  for  the  fixed  slope  GLM  is 
Qo  —  Q((i*,x),  where  <3o  is  the  maximum  quasilikelihood  for  the  full  model  and  //*  is  the  set 
of  mean  response  values  predicted  using  the  MLEs  a*  and  0  =  —  In  d.  Suboptimal  estimates 
for  the  MLEs  of  the  fixed  slope  model  can  be  derived  by  using  a  =  a  and  0  =  —  In  d.  Let  Jx 
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be  the  set  of  mean  response  values  predicted  using  the  suboptimal  estimates;  then  Q(p,x), 
which  is  the  quasilikelihood  for  the  suboptimal  estimates,  is  bounded  below  by  Q(p*,x). 
Therefore,  Qq  —  Q(p,x)  is  an  upper  bound  on  the  deviance  of  the  fixed  slope  GLM.  The 
upper  bound  for  the  logit  model  is  46.2412  for  CAREC  and  13.617  for  NRL.  Since  the 
corresponding  significance  levels  are  0.9996  and  1.0  respectively,  we  deduce  that  the  fixed 
slope  model  provides  a  reasonable  description  of  the  data. 

The  quasilikelihood  analysis  can  also  provide  useful  insights  into  the  mechanism  of 
ELISAs  and  the  reliability  of  the  test  outcomes.  Consider  the  coefficient  of  variation  of  the 
normalized  OD  level  Xy,  which  is 


-  .  ^M  (18) 

Equation  (18),  the  values  of  <{)  in  Table  1 ,  and  Figure  4  suggest  that  the  coefficient  of  variation 
is  quite  small  for  HIV  positive  samples  that  have  not  been  substantially  diluted,  whereas  it 
is  much  larger  (near  one)  for  HIV  negative  samples  or  highly  diluted  positive  samples.  This 
observation  will  be  instrumental  in  the  development  of  the  Monte  Carlo  simulation  model 
in  Subsection  6.2.  As  a  side  remark,  since  0  =  |,  the  value  of  k  is  at  least  15,  and  hence  the 
Central  Limit  Theorem  employed  in  (6)  is  a  reasonable  approximation. 

Two  variants  of  the  model  were  also  considered  in  a  failed  attempt  to  obtain  a  more 
accurate  fit.  Recall  that  (13)  was  derived  under  the  rather  crude  approximation  ln(l  +  x)  ~ 
0.  For  high  dilution  levels,  the  assumption  pi0  S>  ploodJ  (see  equation  (12))  underlying 
this  approximation  will  be  violated.  Employing  the  second  order  Taylor  approximation 
ln(l  +x)«i,  we  tested  the  refined  model 

ln(-ln(l -£[*„]))  =\n(^)-j\nd  +  d3^.  (19) 

v  '  k  pio 

Residual  plots  and  scatter  plots  that  are  not  displayed  here  indicate  that  for  all  three  link 
functions,  this  refinement  has  very  little  effect  on  the  quality  of  the  model  fit.    We  also 
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tested  the  alternative  variance  function  V(//)  =  Ojjl2{\  —  fi)2  on  the  GLM  with  all  three  link 
functions.  The  deviance  for  the  variance  function  V(fj,)  =  <t>^{\  —  n)  is  significantly  smaller 
than  the  corresponding  deviance  for  V{n)  =  (pii2{\  —  /j)2,  indicating  that  the  original  variance 
function  provides  a  better  description  of  the  data. 

4.2.  A  Simplified  Pooling  Model 

The  complexity  of  the  traditional  (i.e.,  where  no  dilution  effect  is  captured)  group 
testing  problem,  combined  with  the  complexity  of  the  GLM,  leads  to  an  analytically  in- 
tractable model  for  analyzing  group  testing  policies  for  ELISAs.  Consequently,  we  propc 
two  simplifications  of  the  GLM  that  will  allow  for  a  tractable  analysis  in  Section  5.    This 
simplified  model  will  be  validated  on  the  Cahoon-Young  et  al.  data  in  Subsection  4.3. 

The  first  simplification  is  rather  bold:  Motivated  by  our  earlier  observation  that  the 
dispersion  estimates  <f>  in  Table  1  are  small,  we  propose  to  ignore  the  variability  in  the  GLM 
and  employ  a  deterministic  model  that  provides  a  one-to-one  mapping  between  normalized 
OD  readings  and  antibody  concentrations.  Although  this  assumption  compromises  the  accu- 
racy of  the  GLM  for  the  sake  of  tractability,  the  discussion  about  the  coefficient  of  variation 
Cu  below  equation  (18)  suggests  that  the  resulting  deterministic  model  should  be  reasonably 
reliable  for  the  dilution  of  HIV  positive  samples  at  practical  dilution  levels.  Our  assumption 
leads  to  the  simplified  dilution  model 

h(l^)-h(T)-"»*  (20) 

where  the  logit,  rather  than  the  cloglog,  function  is  being  employed. 

Recall  that  this  dilution  model  is  appropriate  for  the  CAREC  and  NRL  dilution  series, 
where  a  given  positive  serum  is  diluted  with  a  varying  amount  of  a  fixed  HIV  negative  serum, 
but  is  not  appropriate  for  Cahoon-Young  et  al.'s  data,  which  mirrors  an  actual  pooled  test. 
We  now  present  a  variant  of  our  model  that  is  appropriate  for  pooled  testing.   Let  a  pool 
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consist  of  n  samples  that  have  individual  normalized  OD  readings  Xu. . .  ,Xn  and  antibody 
concentrations  p\,...  ,pn-  Let  X  and  p  denote  the  normalized  OD  level  and  the  antibody 
concentration,  respectively,  of  the  pool.  The  only  difference  between  the  dilution  GLM  (13) 
and  the  pooling  GLM  is  that  equation  (12)  is  replaced  by  p  =  (p{  +  . . .  4-  pn)/n.  Repeating 
the  steps  leading  from  (11)  to  (13)  gives  the  pooling  GLM 

and  ignoring  the  variability  in  this  GLM  leads  to 

which  is  the  pooling  analog  to  the  simplified  dilution  model  (20). 

Our  second  simplification  is  to  replace  the  logarithm  of  the  average  antibody  concen- 
tration in  (22)  by  a  linear  approximation,  which  yields 

ln(l vs)  =  -On  p,  +  .  •  -  +  In pn)  -  In  k.  (23) 

v  1  —  Ap/        n 

Since  lnp*  =  ln(-j^r)  +ln  k  for  i  =  1, . . .  ,n  by  (22),  we  obtain  the  simplified  pooling  model 

The  concavity  of  the  logarithmic  function  implies  that  ln(pl+n+^n)  >  (lnpl+n'l''np");  hence, 
the  linearity  assumption  is  conservative  in  that  it  underestimates  the  OD  for  the  pooled 
sera,  and  provides  an  upper  bound  on  the  number  of  false  negatives  that  result  as  a  conse- 
quence of  pooled  testing.  Our  simplified  pooling  model  (24)  has  an  interesting  and  tractable 
characterization:  the  LOD  (logit  of  the  normalized  OD)  reading  for  a  pool  is  given  by  the 
average  of  the  individual  LOD  readings. 
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4.3.  Validation  of  the  Simplified  Pooling  Model 

The  simplified  pooling  model  (24)  will  be  validated  on  the  Cahoon- Young  et  al.  data 
in  this  subsection.  The  randomness  in  the  individual  LOD  readings,  when  embedded  into 
the  deterministic  pooling  model,  leads  to  a  random  walk  model  for  the  pooling  data.  We 
fit  the  random  walk  model  to  the  pooling  data,  and  use  a  nonparametric  approach  to  test 
whether  the  increments  of  the  random  walk  are  independent  and  have  zero  mean. 

Recall  that  Cahoon- Young  et  al.  individually  tested  1280  samples,  and  then  generated 
nested  pools  of  size  10,  20,  40  and  80.  The  total  sample  contained  12  HIV  positive  individuals, 
and  none  of  the  pools  contained  more  than  one  positive  sample;  hence,  12  of  the  16  pools  of 
size  80  contained  exactly  one  positive  sample. 

We  only  test  the  random  walk  model  on  the  pools  that  contain  one  positive  sample. 
In  fact,  the  Cahoon- Young  et  al.  data  is  incomplete:  Two  of  the  12  positive  samples  were 
not  tested  at  all  pool  sizes.  Hence,  we  will  restrict  ourselves  to  the  ten  samples  that  were 
tested  at  all  pool  sizes.  Let  i  =  1, . . . ,  10  index  the  ten  positive  samples.  Let  Yn  denote  the 
LOD  reading  for  positive  sample  i,  Yij  be  the  LOD  reading  of  the  jth  negative  sample  pooled 
with  positive  sample  i,  and  Y?  denote  the  LOD  reading  for  the  pool  of  size  10  x  2s  containing 
positive  sample  i;  this  pool  consists  of  samples  Yn, . . . ,  Vi,iox2',  where  s  =  0, 1,2, 3. 

Notice  that  for  fixed  i,  Y^  are  iid  random  variables  for  j  =  2, 3, . . . ,  80,  since  they  all 
correspond  to  LOD  readings  for  negative  sera;  the  assumption  of  normal  LOD  readings  is 
not  required  in  this  subsection.  Let  /i  =  E(Yl])  for  j  >  1.  For  i  =  1, . . . ,  10  and  s  =  1,  2,3, 
the  simplified  pooling  model  (24)  implies  that 

t^10x2"  y 

yr,  =  %v^  (25) 

i   v^10x2»-1  y  ^pl0x2"  y 

_       1  2^  =  1  Ii3  ^j=10x2'-1  +  l  i'J  (2gA 

2    10  x  2s"1  10  x  2s 

=    \yT,s-x  +  ^  (27) 
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where  e,5  =  — '="1x03x'2,'tl  is  the  noise  term.  Since  E(txs)  =  ^/i,  equation  (27)  describes  a 
three-step  random  walk  (Y&,  Y&,  Y%,  Y^)  that  starts  at  Yxq,  ends  at  Y%  and  has  drift  i/z.  The 
following  proposition  shows  that  the  autoregressive  process  (27)  can  be  transformed  into  an 
equivalent  driftless  random  walk  by  defining  Via  =  2s (Y?  —  fi). 

Proposition  1   For  i  =  1, . . . ,  10,  (V.o.  Ki,  K2i  K3)  is  a  three-step  driftless  random  walk. 

The  proof  can  be  found  in  the  Appendix.  The  simplified  pooling  model  (24)  can  be  validated 
by  establishing  that  the  driftless  random  walk  model  describes  the  Cahoon- Young  et  al. 
pooling  data.  This  validation  is  accomplished  by  verifying  that  the  random  increments  are 
independent  and  have  zero  mean.  Notice  that  the  random  walk  only  models  the  pooling 
effect  as  the  pool  size  changes  from  10  to  20,  20  to  40,  and  40  to  80;  it  does  not  capture  the 
pooling  effect  when  the  pool  size  changes  from  1  to  10. 

Hypothesis  I:  Independent  Random  Increments.  A  point  estimate  for  /x  is  re- 
quired to  pursue  a  statistical  analysis.  The  following  proposition  suggests  that  the  estimator 
should  be  chosen  to  minimize  the  variance. 

Proposition  2  Define  Vis(x)  =  2S(Y?S  —  x);  then  /z  =  arg  minxE(Vls(x)  —  Vi0(x))2. 

Since  the  true  variance  is  not  available,  we  consider  the  estimator  that  minimizes  the  sample 

variance.   An  additional  degree  of  freedom  can  be  introduced  by  considering  the  weighted 

sum  of  squares 

10    3 

ZE^(Vts(x)-Vl0(x))2.  (28) 

1=1  s=l 

The  weights  Wi,  u^,  W3  can  be  chosen  in  such  a  way  that  the  sample  variance  of  the  estimator 
is  minimized.  The  weighted  least  squares  estimator  ft  minimizing  (28)  is  given  by 


E£i  E5,iM2'(l-2«)YS-(l-  2')Y&}\ 

ioll.wi-23)2} 


~  _      ^t=i  i^a=\\.uJs\^  \>-  -  *  His  -  yi  -  ^  /ilQfi  ,2g. 


and  the  following  two  propositions  characterize  its  statistical  properties. 
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Proposition  3   The  estimator  p,  is  an  unbiased  estimator  of  [i  for  any  choice  of  weights  iut. 

Proposition  4    The  most  efficient  estimator  for  the  pooling  data  is  obtained  for  weights 
W\  =  w2  =  0,11/3  =  1- 

The  proofs  can  be  found  in  the  Appendix. 

Let  us  now  consider  the  process  Vis  —  2s (Y?  —  p.).  As  the  number  of  positive  samples 
in  the  data  tends  to  infinity,  the  Strong  Law  of  Large  Numbers  implies  that  fi  — >  /j  (and 
hence  Vis  — »  Vis)  with  probability  one.  Although  we  have  only  ten  positive  samples,  we  will 
carry  out  a  statistical  hypothesis  test  on  Vis  and  extrapolate  the  conclusions  to  Vls. 

The  independence  assumption  of  the  random  walk  will  be  tested  by  studying  the 
increments  AVis  =  Vis  —  K|3_!  for  s  =  1,2,3.  If  the  increments  are  independent,  then  with 
probability  |  they  are  either  above  or  below  the  median.  We  use  the  runs  above  or  below  the 
median  test,  which  is  a  nonparametric  procedure  that  is  described  in  detail  in  Chapter  4  of 
Madansky  (1988).  The  test  is  applied  as  follows.  Let  uls  take  the  value  1  if  AViS  is  above  the 
sample  median  and  0  otherwise.  A  run  is  a  maximal  consecutive  set  of  random  components 
ViS  having  the  same  value.  For  example,  if  the  sequence  is  1001001110111100110100,  then 
the  runs  are  separated  as  follows:  1|00|1|00|111|0|1111|00|11|0|1|00,  and  the  run  count  is 
12.  A  low  run  count  is  an  indication  that  observations  below  or  above  the  median  come 
together,  whereas  a  high  count  is  typical  for  median  reverting  behavior.  Note  that  under 
the  independence  hypothesis,  the  run  count  for  the  random  vector  (A1/tl,  AVl2,  AVi3)  is  k 
with  probability  pfc,  where  px  =  p3  =  j  and  p2  =  \-  Define  Tk  to  be  the  number  of  random 
vectors  in  our  data  set  with  run  count  k,  where  J23k=l  Tk  =  10.  Then 

P{TX,T2,T,)  =       *0!     .pTM2Pl*  (30) 

1  X-J-2-1  3' 

is  the  significance  of  the  observations  under  the  null  hypothesis. 

For  our  data,  we  calc.  .ted  T\  —  2,T2  =  5,T3  =  3;  the  significance  of  these  obser- 
vations is  P(Ti  =  2,  T2  =  5,r3  =  3)  =  0.077.    Although  a  p- value  for  this  test  cannot 
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be  obtained  without  ordering  the  state  space,  the  probability  of  observing  an  outcome  as 
extreme  as  this  under  the  independence  assumption  is  at  least  0.077;  hence,  we  cannot  reject 
the  independence  assumption  at  the  95%  significance  level.  In  fact,  the  outcome  (2,5,3)  is, 
along  with  outcomes  (3,5,2)  and  (2,6,2),  the  mode  of  the  distribution  of  (Ti,T2,T3). 

Hypothesis  II:  Zero  mean  random  increments.  The  95%  confidence  intervals 
for  the  mean  of  AVlU  AVl2  and  AVl3  are  given  by  0.2065  ±  0.2529,  0.08032  ±  0.4961  and 
—0.3491  ±  1.0637,  respectively.  The  zero  mean  hypothesis  cannot  be  rejected,  since  zero  is  a 
common  point  of  the  three  intervals.  In  conclusion,  the  data  support  the  hypothesis  that  the 
random  walk  (27)  provides  a  realistic  description,  thus  establishing  their  consistency  with 
the  simplified  pooling  model  (24). 

5.  The  Derivation  of  Group  Testing  Policies 

In  this  section,  we  embed  the  simplified  pooling  model  (24)  into  an  optimization 
framework  to  find  efficient  pooled  testing  policies.  Our  objective  is  to  minimize  the  expected 
weighted  cost  due  to  testing,  false  positives  and  false  negatives.  Suppose  a  pool  of  a  speci- 
fied size  is  tested  and  its  LOD  reading  is  determined.  The  decision  maker  has  three  options: 
stop  testing  and  classify  all  individuals  in  the  pool  as  HIV  negative  (and  transfuse  these 
samples),  stop  testing  and  classify  all  individuals  in  the  pool  as  HIV  positive  (and  discard 
these  samples)  or  divide  the  pool  into  subpools  for  further  testing.  There  are  many  possible 
ways  to  subdivide  the  pool  under  the  third  option,  and  we  consider  a  quite  general  class  of 
multistage  policies  employed  by  Arnold,  where  each  subpool  is  of  identical  size.  Our  proce- 
dure can  be  modified  slightly  to  allow  unequal  subpool  sizes,  as  in  the  sequential  procedure 
in  Hwang  (1984)  and  the  T£(V)  procedure  in  Litvak  et  al. 

For  a  given  initial  pool  size  and  subpool  configuration,  a  dynamic  programming  al- 
gorithm is  developed  in  Subsection  5.1  for  finding  the  optimal  policy  within  the  class  of 


multistage  policies  under  consideration.  Exhaustive  search  among  alternative  initial  pool 
sizes  and  subpool  configurations  is  required  to  find  the  cost  minimizing  policy.  Structural 
properties  of  the  optimal  policy  are  investigated  in  Subsection  5.2.  Since  the  dynamic  pro- 
gramming algorithm  is  computationally  intensive  and  the  resulting  policy  difficult  to  im- 
plement, a  procedure  for  deriving  near  optimal  Dorfman  policies  is  derived  in  Subsection 
5.3. 

5.1.  The  Dynamic  Programming  Formulation 

We  assume  that  the  blood  donor  population  is  composed  of  two  subpopulations:  HIV 
negative  (denoted  P_)  and  HIV  positive  (P+).  The  LOD  readings  of  P_  (P+,  respectively) 
are  assumed  to  be  iid  normal  random  variables  with  mean  /i_  (//+,  respectively)  and  standard 
deviation  cr_  (a+,  respectively).  This  assumption  is  consistent  with  the  GLM  and  provides 
a  reasonable  fit  to  the  data  in  Figure  2.  The  known  seroprevalence  ir  is  the  probability 
that  a  random  donor  is  HIV  positive.  Arnold's  notation  will  be  adopted  to  describe  the 
multistage  testing  procedure.  Consider  a  random  sample  of  n\  =  fljli  &}  individuals  from 
the  donor  population;  the  a/s  dictate  the  subpool  configuration.  Blood  sera  is  collected 
from  all  ri\  individuals  and  is  indexed  in  such  a  way  that  the  LOD  reading  for  every  sample 
is  denoted  by  {Y(ti,..„  ,In),  1  <  ii  <  Oi,.. . ,  1  <  ijv  <  Q/v}-  The  individuals  are  tested 
and  classified  according  to  the  following  multistage  screening  procedure  (see  Figure  7  for  a 
simple  example):  Start  by  obtaining  Y0,  the  LOD  reading  of  the  pool  composed  of  all  n\ 
individuals.  Based  on  Y0,  decide  whether  all  individuals  in  the  pool  can  be  classified  as  HIV 
negative  or  HIV  positive;  if  so,  stop  testing.  If  not,  then  subdivide  the  population  into  ax 
subpopulations  of  size  n^  =  11^=2  aj>  w^tn  the  first  subpopulation  consisting  of  all  individuals 
with  ii  =  1,  the  second  with  i\  =  2  and  so  on.  Obtain  the  LOD  readings  Vi ( 1 ) , . . . ,  Vi(ai) 
for  all  subpopulations.    For  each  subpopulation,  decide  whether  all  individuals  should  be 
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Y<1,1>    Y(l, 2)     Y(2,l)    Y(2,2) 


Yx  (1) 
Y(l,l)    Y(l,2) 


Y2  ( 


1 , 1) I  |Y2(1,2) 


7\ 


Yx  (2) 
Y(2,l)     Y(2,2) 


Figure  7:  A  simple  example  of  a  multistage  group  testing  procedure. 

classified  as  HIV  negative  or  positive  based  on  the  pair  (Y0,Yi(j));  if  so,  stop  testing.  If 
not,  subdivide  those  subpopulations  that  require  further  testing  into  a2  subpopulations  of 
size  713  =  YijL3aj-  Continue  in  this  vein  until  either  all  pools  are  deemed  HIV  negative  or 
positive,  or  stage  N,  where  individual  testing  is  used,  is  reached.  The  testing  scheme  can 
be  equivalently  described  by  a  rooted  tree,  where  the  nodes  of  the  tree  correspond  to  the 
different  subgroups  formed  during  the  procedure. 

According  to  the  simplified  dilution  model  (24),  the  LOD  readings  are  inductively 
defined  by  Y^{i\,-  ■  ■  ,In)  =  Y(i\,. . . ,  z'jv)  and 


Yj_i(ii,...  ,ij_i)  = 


a. 


(31) 


The  state  of  the  system  at  any  stage  of  the  screening  process  can  be  described  by  the 
LOD  readings  obtained  thus  far.  If  the  pool  that  is  currently  being  screened  is  com- 
posed of  all  individuals  with  the  first  j  indices  given  by  ix, . . .  ,ij,  then  the  state  is  given 
by  (Yo,Yi(ii),...  ,Yj(ii,...  ,ij)).  To  simplify  notation,  we  shorten  Yj(i\,...,ij)  to  Y3  for 
j  =  1, . . . ,  N,  and  denote  the  state  of  the  system  by  Sj  =  {Y0, . . . ,  Y3)  for  j  =  0, . . . ,  N.  If 
the  current  state  of  the  system  is  Sj  and  j  <  N  —  2,  then  three  decisions  are  possible:  Either 
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declare  all  individuals  in  the  pool  as  negative  and  stop  testing,  declare  all  individuals  in  the 
pool  as  positive  and  stop  testing,  or  subdivide  the  pool  into  aJ+\  subpools  of  size  nJ+2  and 
continue  testing.  Under  the  first  decision,  a  false  negative  cost  cp^  is  incurred  for  each  HIV 
positive  individual  in  the  pool,  and  under  the  second  decision,  a  false  positive  cost  cpp  is 
incurred  for  each  HIV  negative  individual  in  the  pool.  Under  the  third  decision,  a  testing 
cost  c(nJ+2)  is  incurred  for  each  of  the  a)+\  subpools.  By  defining  a^+i  =  1,  we  can  adopt 
the  same  notation  for  the  decision  at  stage  N  —  1.  At  stage  N  (individual  testing),  the 
individuals  are  classified  as  HIV  positive  or  negative,  and  the  false  positive  cost  cpp  or  the 
false  negative  cost  Cfn  is  incurred  for  any  individuals  that  are  misclassified. 

If  we  let  Jj(Sj)  denote  the  optimal  cost  for  stages  j  through  TV  when  in  state  S3  at 
stage  j,  then  the  dynamic  programming  algorithm  is  defined  inductively  by 

JN(SN)    =    min  {cFPP(YN  e  P-\SN),cFNP(YN  e  P+\SN)}  ,  (32) 

Jj(Sj)    =    Tmn{aj+i(c(nj+2)  +  E[Jj+l(Sj+i)\Sj]), 

aj+i  a/v 

cfn  £  •••  £  P(YN(iu...,iN)eP+\Si), 

cfp   E   ...f^P(YN(ill...,iN)eP-\Sj)}  (33) 

for  j  =  0, . . . ,  N  —  1.  Because  the  individual  LOD  readings  of  each  sample  in  a  pool  are  iid 
random  variables,  equation  (33)  can  be  simplified  to 

Jj(Sj)  =  Tmn{aj+1(c(nj+2)  +  E[Jj+1(Sj+1)\Sj]), 

nj+1cFNP(yjeP+\Sj),nj+1cFPP(YjeP..\Sj)}   for  j  =  0,...,N  -  1,         (34) 

where  V}  is  a  random  variable  denoting  the  individual  LOD  reading  of  a  generic  member  of 
the  pool  at  stage  j. 

Since  the  state  of  the  system  at  stage  j  is  given  by  the  LOD  readings  obtained  through 
stage  j,  the  dimensionality  of  the  state  space  grows  as  the  dynamic  programming  algorithm 
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proceeds;  hence,  the  algorithm  in  (32)  and  (34)  cannot  be  efficiently  used  for  numerical 
calculations.  The  following  proposition  shows  that  we  need  only  keep  track  of  the  most 
recent  LOD  reading. 

Proposition  5  The  state  of  the  system  at  every  stage  can  be  adequately  described  by  the 
latest  LOD  reading. 

We  prove  this  proposition  using  Corollary  2  of  Arnold,  which  is  stated  here  for  completeness. 
Corollary  2  (Arnold,  1977)  The  conditional  distribution  ofYJ+\  given  Sj  is  the  same  as 
the  conditional  distribution  ofYj+i  given  Yy 

The  following  lemma,  whose  proof  is  given  in  the  Appendix,  is  also  needed: 

Lemma  1  There  exists  a  version  of  the  conditional  probability  P(Yj  G  P+\S3)  that  depends 
on  S:  only  through  Y2;  the  same  is  true  for  P(Yj  G  P-\Sj). 

Proof  of  Proposition  5.  The  proposition  can  be  proved  by  induction  on  the  dynamic 
programming  algorithm.  The  proposition  is  true  for  j  =  N;  assume  that  it  is  true  for  j  =  k, 
so  that  Jk(Sk)  =  Jk(Yk)-  By  (34)  and  Lemma  1,  Jk-i  is  a  function  of  Jjt(V/t),  P(Yk  G  P+\Yk) 
and  P(Yk  G  P_|YJt);  hence,  the  proposition  follows  from  Corollary  2.  ■ 

Therefore,  we  can  replace  the  state  Sj  by  the  latest  LOD  reading  Yj,  and  the  optimal 
decision  rule  can  be  described  by  a  set  of  critical  regions  R*  and  Rj  for  j  =  0, . . . ,  N  such 
that  if  Yj  G  Rj  then  all  samples  in  the  pool  are  classified  as  HIV  negative  and  released  for 
transfusion,  if  Yj  G  Rj  then  all  samples  are  classified  as  HIV  positive  and  discarded,  and 
otherwise  additional  tests  are  carried  out.  The  critical  regions  are  defined  by 

flS  =  {>':^<-^-T},  (35) 

{  f+(Y)  CFp(\-TT)) 

I        f+\Y)       cFp{l-n)j 
fit  =  {Yj  :  cFPnJ+2P  (Yj  G  P.\Yj)  < 
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min  {cFNn]+2P  (Yj  €  P+\Y3)  ,c(nJ+2)  +  E  [Jj+1  (Yj+l)  \Yj]}}  ,  (37) 

R~  =  {Yj  :  cFNnJ+2P  fc  e  P+\YS)  < 

rmn{cFpnj+2P  (Y3  €  P.\Yj)  ,c(nJ+2)  +E[Jj+1  (Yj+1)  \Yj\}}  ,  (38) 

where  /_  and  /+  denote  the  normal  densities  for  the  HIV  negative  and  HIV  positive  pop- 
ulations, respectively.  Notice  that  the  critical  region  R^  maximizes  the  power  for  a  simple 
hypothesis  test.  Therefore,  by  the  Neyman-Pearson  lemma,  the  proposed  classification  pol- 
icy at  the  individual  testing  stage  not  only  minimizes  the  cost  for  the  particular  choices  of 
Cfn  and  cFp,  it  also  minimizes  the  type  II  error  (false  positive)  for  a  fixed  level  of  type  I 
error  (false  negative). 

5.2.  Structural  Properties  of  the  Optimal  Policy 

Intuitively,  one  might  expect  that  the  optimal  classification  policy  could  be  charac- 
terized by  a  set  of  constants  {cj,c+  :  0  <  j  <  N}  (where  cjj  =  c%)  such  that  Rj  — 
lYj  :  Yj  <  c~\  and  R*  =  lYj  :  Yj  >  cf\.  Such  a  classification  policy  for  a  generalized  group 
testing  procedure  will  be  called  a  cutoff  policy.  Arnold  obtained  sufficient  conditions  ensur- 
ing the  optimality  of  a  cutoff  policy  for  a  simpler  group  testing  problem  that  possesses  only 
two  possible  classifications.  Here,  we  extend  his  results  to  the  model  in  Subsection  5.1. 

The  following  monotonicity  notion  was  introduced  in  Arnold:  The  density  g3(yj)  of 
the  LOD  reading  Yj  has  the  Mon(j')  property  if  for  all  nonincreasing  functions  h(y),  the 
conditional  expectation  E{h{Yj)\YJ_\  =  s)  is  monotone  nonincreasing  in  s.  The  following 
proposition,  which  is  proved  in  the  Appendix,  provides  sufficient  conditions  for  the  optimality 
of  the  cutoff  policy. 

Proposition  6  A  cutoff  policy  is  optimal  if  the  likelihood  ratio  -r-  is  monotone  nondecreas- 
ing  and  the  density  gj(y)  ofYj  has  the  Mon(j)  property  for  all  j. 
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The  definition  of  Mon(j)  cannot  be  used  for  testing  whether  a  density  has  the  required 
property.  Instead,  the  following  proposition  can  be  employed  (see  the  Appendix  for  a  proof). 

Proposition  7   The  density  g}{y)  has  the  Mon(j)  property  if  for  all  j  and  all  y,  P{Y}  < 
y\Yj-i  =  s)  is  a  nonincreasing  function  of  s. 

It  turns  out  that  neither  of  the  sufficient  conditions  in  Proposition  6  are  satisfied  by 
our  data.  Recall  that  /_  is  the  normal  density  with  mean  /i_  =  —4.82  and  cr_  =  0.42, 
and  /+  is  normal  with  mean  /i+  =  0.8  and  a+  =  1.08.  Because  the  variation  in  P+  is 
larger  than  the  variation  in  P_,  *f-  is  not  monotonically  nonincreasing.  The  individual  LOD 
readings  Y^  are  distributed  as  a  mixture  of  two  normals.  By  (31),  Vjv-i  is  the  average  of  a 
collection  of  iid  random  variables,  each  of  which  is  a  mixture  of  normals.  It  can  be  shown 
that  the  distribution  of  Vjv|Yjv-i  is  a  more  complex  mixture  of  normals  that  does  not  satisfy 
the  Mon(_7')  property  for  our  parameter  values.  Although  neither  condition  in  Proposition  6 
is  satisfied  by  our  data,  the  optimal  cutoff  policy  performed  nearly  as  well  as  the  overall 
optimal  policy  in  the  computational  study  described  in  the  next  section. 

5.3.  The  Dorfman  Policy 

In  the  Dorfman  procedure,  a  pool  of  a  specified  size  is  tested,  after  which  either  every 
sample  in  the  pool  is  deemed  HIV  negative,  or  every  sample  in  the  pool  undergoes  individual 
testing.  Due  to  its  simplicity  and  effectiveness,  this  procedure  is  frequently  used  in  practice 
for  mass  screening  programs.  In  particular,  recent  field  studies  (e.g.,  Behets  et  al.,  Emmanuel 
et  al.  and  Kline  et  al.)  in  developing  countries  demonstrate  that  such  procedures  can  be 
used  to  reduce  the  cost  of  HIV  screening.  The  complexity  of  general  group  testing  strategies, 
such  as  the  one  described  in  Subsection  5.1,  renders  them  more  vulnerable  to  human  error. 
Therefore,  the  improvement  achieved  by  the  more  complex  testing  strategy  could  be  offset 
by  the  human  errors  incurred  during  implementation. 
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Using  the  dynamic  program  of  Subsection  5.1,  we  can  obtain  the  optimal  decision 
rule  for  a  Dorfman  procedure  with  pool  size  n  by  setting  TV  —  1  and  n\  =  ax  =  n,  and 
disallowing  the  option  of  discarding  a  pool  that  contains  more  than  one  sample.  However, 
numerically  solving  the  dynamic  programming  algorithm  requires  a  discretization  of  the  state 
space  of  LOD  readings,  and  can  be  cumbersome  and  computationally  intensive.  Therefore, 
we  propose  a  relatively  simple  method  for  obtaining  a  near  optimal  Dorfman  policy.  Our 
method  relies  on  two  simplifying  assumptions:  (i)  a  cutoff  policy  is  employed  (that  is,  the 
individuals  in  the  pool  are  transfused  if  the  LOD  reading  of  the  pool  is  below  a  certain 
threshold,  and  each  sample  in  the  pool  is  individually  tested  otherwise;  in  the  latter  case, 
a  second  threshold  is  used  to  classify  individuals  as  HIV  positive  or  negative),  and  (ii)  the 
outcome  of  the  pooled  test  is  used  to  calculate  the  posterior  seroprevalence,  but  not  the 
posterior  conditional  densities.  The  first  assumption  is  not  very  restrictive,  particularly 
since  cutoff  policies  are  the  only  policies  that  are  apt  to  be  adopted  in  practice.  However, 
the  second  assumption  is  clearly  not  making  the  most  efficient  use  of  the  pooled  OD  reading. 

Consider  a  Dorfman  policy  of  pool  size  n  applied  in  a  seroprevalence  n  population. 
Let  Yi  be  the  LOD  reading  of  the  ith  individual  and  Yp  be  the  LOD  reading  of  the  poc 
size  n.  Suppose  that  x  is  the  cutoff  employed  for  individual  testing  and  z  is  the  cutoff  for 
group  testing.   If  /_  and  /+  are  the  probability  densities  for  the  LOD  readings  for  P_  and 
P+  respectively,  then  the  cost  of  an  individual  test  is 

/■+00  rx 

d(n,x)  =  c(l)  +  cFP(l  -  tt)  /       f-(y)dy  +  cFNn  /      f+(y)dy.  (39) 

Jx  J —<x> 

Let  Akn  be  the  event  that  A:  out  of  the  n  individuals  are  HIV  positive.  The  cost  incurred  at 
the  group  testing  stage  of  the  process  is 

Cg(z)  =  c(n)  +  cFN  J2  P(Alm)P(Yp  <  z\Akn)k,  (40) 

fc=i 

where  the  first  term  is  the  testing  cost  and  the  second  is  the  misclassification  cost  of  false 
negatives.  Since  Yi, . . . ,  Yn  are  iid,  it  follows  that  P{Yl  e  P+\Akn)  =  £•  Under  our  second 
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assumption,  the  cost  incurred  at  the  second  stage  of  the  testing  procedure  is 

Cig(z,x)  =  £  P(yp  >  z|.4fcn)P(/W)nC;(-,:c);  (41) 

k=o  n 

hence,  the  cost  per  individual  is  C(n,x,z)  =  ^[Cg(z)  +  Cig(z,x)},  or 

C(n,x,z)    =    c(n)  +  cFN  £         "j_       7r*(l  -  ir)n-kkP(Yp  <  z\Akn) 

The  proposed  Dorfman  procedure  is  given  by  the  solution  to  minni  zC(n,x,  z),  which  can 
be  solved  in  two  stages:  Obtain  the  optimal  cutoffs  x  and  z  for  every  n,  and  then  search 
among  the  integers  for  the  optimal  group  size  n. 

Under  our  probabilistic  assumptions,  the  LOD  reading  of  a  pool  composed  of  k  HIV 

positive  and  n  —  k  HIV  negative  individuals  is  normally  distributed  with  mean  fikn  = 
fc„++(n-fc?„-  and  variance  a2n  =  k<rl+(n-k)<rl ,  that  .^  ^^  _  N(^kn,a2kn).  The  opti- 
mal cutoff  points  are  obtained  by  solving  the  first  and  second  order  optimality  conditions. 
Thus,  a  locally  optimal  solution  is  obtained,  which  turned  out  to  be  globally  optimal  in  our 
numerical  studies. 

6.  Computational  Results 

In  this  section,  we  assess  the  relative  performance  of  four  testing  policies:  individual 
testing  (with  optimal  cutoff  values  derived  from  equations  (35)-(36)),  the  heuristic  Dorfman 
policy  developed  in  Subsection  5.3,  the  optimal  Dorfman  policy  derived  from  the  dynamic 
programming  algorithm  and  the  optimal  generalized  group  testing  policy.  A  wide  range  of 
scenarios  are  considered  by  varying  the  seroprevalence  and  false  negative  cost.  The  derivation 
of  the  optimal  policies  is  described  in  Subsection  6.1  and  the  Monte  Carlo  simulation  model 
is  specified  in  Subsection  6.2.  The  policies  are  tested  on  the  simulation  model  in  Subsection 
6.3.  In  Subsection  6.4,  we  apply  our  model  to  the  data  from  N'tita  et  al.  (1991). 
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6.1.  Computer  Implementation 

The  heuristic  Dorfman  policy  in  Subsection  5.2  was  implemented  using  Maple  on  a 
Sun  Sparc  station  20.  The  partial  derivatives  of  C(n,x,z)  with  respect  to  x  and  z  were 
obtained  using  symbolic  differentiation,  and  the  stationary  points  were  identified  using  the 
built-in  routine  solve.  Only  stationary  points  lying  in  the  rectangle  [/x_,/i+]  x  [/i_,/i+]  were 
considered,  and  these  points  satisfied  both  the  first  and  second  order  optimality  conditions 
in  all  cases.  A  search  over  the  integers  was  then  employed  to  obtain  the  optimal  group  size  n. 
The  optimal  group  size  was  found  to  be  bounded  above  by  20  for  n  >  0.001  and  cfn  >  100; 
hence,  the  search  was  restricted  to  this  region  and  a  procedure  similar  to  interval  halving 
was  used. 

The  implementation  of  the  dynamic  programming  algorithm  is  more  complex.  The 
continuous  state  space  must  be  truncated  and  discretized:  our  state  space  consisted  of  200 
equally  spaced  points  in  [/i_  -  6cr_,/i+  -I-  6er+],  with  step  size  0.074.  Simpson's  numerical 
integration  rule  with  fixed  interval  size  was  employed  to  achieve  four  digit  accuracy. 

6.2.  The  Simulation  Model 

The  analytical  model  in  Section  5  assumes  that  LOD  readings  are  normally  distributed 
and  employs  the  simplified  pooling  model  (24);  however,  we  believe  that  this  model  is  not 
sufficiently  realistic  to  provide  a  reliable  assessment  of  the  policies.  Therefore,  we  resort  to 
Monte  Carlo  simulation  to  obtain  a  more  realistic  model.  However,  building  a  simulation 
model  for  this  problem  is  a  nontrivial  task:  There  are  two  possible  sources  of  uncertainty 
in  a  pooled  LOD  reading,  the  variability  in  the  individual  antibody  concentrations  and  the 
randomness  in  the  manner  in  which  antibodies  are  detected  by  ELISAs,  and  it  is  difficult  to 
assess  the  relative  impact  of  each  source.  Moreover,  the  pooling  GLM  (21),  which  predicts 
the  normalized  OD  level  of  a  pool  as  a  function  of  the  antibody  concentrations  of  the  indi- 
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viduals  comprising  the  pool,  cannot  be  directly  simulated  because  the  underlying  antibody 
concentrations  are  unobservable. 

Consequently,  we  test  the  policies  on  two  simulation  models  of  varying  complexity. 
The  simpler  simulation  model  randomly  generates  LOD  readings  for  positive  and  negative 
individuals  from  the  empirical  distributions  of  Dax  that  appear  in  Figure  2(b)  and  uses  the 
deterministic  pooling  model  (22).  Taking  n  =  1  in  equation  (22)  implies  that  an  individual 
sample's  antibody  concentration  pt  is  related  to  its  normalized  OD  level  Xt  by  px  =  rrv:- 
Substituting  ^jj-  for  pi  in  (22)  gives 

and  hence  the  value  of  the  parameter  k  need  not  be  estimated  for  the  simulation  model. 
This  simulation  model  is  more  realistic  than  the  analytical  model  in  two  ways:  the  pooling 
model  (22)  does  not  employ  the  linear  approximation  embedded  in  model  (24),  and  the  LOD 
readings  are  drawn  from  the  empirical  distributions  rather  than  the  normal  distributions. 
Although  the  simulation  model  ignores  the  stochastic  component  of  the  GLM  arising  from 
the  binding  mechanism  of  ELISA,  both  the  variability  in  the  antibody  concentration  and  the 
uncertainty  due  to  the  binding  mechanism  are  embedded  in  the  empirical  LOD  distributions 
of  Dax;  hence,  the  simulation  model  indirectly  captures  the  second  source  of  uncertainty. 

The  more  complex  simulation  model  attributes  the  variability  of  the  LOD  readings 
to  the  variability  of  antibody  concentrations  and  the  stochastic  component  of  the  GLM, 
and  is  derived  from  the  additional  assumptions  that  (a)  the  stochastic  component  of  the 
GLM  is  negligible  for  HIV  positive  individuals  and  (b)  the  antibody  concentration  in  HIV 
negative  individuals  is  deterministic.  The  first  assumption  is  motivated  by  the  observation 
appearing  near  the  end  of  Subsection  4.1  that  the  normalized  OD  reading  for  a  HIV  positive 
individual  with  a  given  antibody  concentration  has  a  very  small  coefficient  of  variation.  To 
justify  the  second  assumption,  we  recall  that  the  normalized  OD  reading  for  a  HIV  negative 


individual  with  a  given  antibody  concentration  has  a  coefficient  of  variation  roughly  equal 
to  one.  The  normalized  OD  readings  for  HIV  negative  individuals  in  Figure  2(a)  have  mean 
0.0083  and  standard  deviation  0.0085,  and  hence  coefficient  of  variation  1.016.  Therefore,  the 
variability  of  the  normalized  OD  readings  of  HIV  negative  individuals  is  mostly  due  to  the 
uncertainty  in  the  binding  mechanism  that  is  captured  in  stochastic  component  of  the  GLM, 
and  consequently  the  variance  of  the  antibody  concentration  of  HIV  negative  individuals  can 
be  approximated  by  zero. 

Let  p_  denote  the  deterministic  antibody  concentration  of  the  HIV  negative  individ- 
uals. To  estimate  p_  from  the  data,  notice  that  equations  (10)  and  (11)  (with  the  logit 
function  replacing  the  cloglog  function  in  (11))  imply  that  the  normalized  OD  readings  for 
HIV  negative  individuals  are  N  \-j^rr,  (fcHfp_)2)-  By  setting  the  mean  and  variance  of  this 
normal  distribution  equal  to  the  respective  mean  and  variance  of  the  empirical  distribution 
in  Figure  2(a),  we  obtain  two  equations  and  two  unknowns.  The  solution  to  these  equa- 
tions is  p_  =  0.968  and  k  =  115.26.  The  large  discrepancy  between  the  latter  value  and 
the  estimated  value  of  about  20  from  Table  1  may  be  due  to  the  fact  that  one  estimate  is 
obtained  from  individual  testing  data  and  the  other  is  obtained  from  a  dilution  study.  To 
generate  the  random  antibody  concentrations  p,  for  HIV  positive  individuals,  we  randomly 
sample  normalized  OD  readings  Xl  from  the  empirical  distribution  in  Figure  2(a),  and  then 
invert  equation  (20)  to  obtain  Pi  =  kXJ(\  —  Xt)  =  115.26Xt/(l  —  Xt).  Then  we  use  equa- 
tions (10)  and  (21)  to  calculate  the  normalized  OD  reading  X  for  a  pool  of  size  n,  where 
the  antibody  concentrations  of  the  n  samples  are  generated  from  the  seroprevalence  and  the 
two  distributions  specified  earlier. 

It  is  not  clear  to  us  which  of  the  two  simulation  models  is  more  realistic;  although 
the  more  complex  model  incorporates  the  stochasticity  in  the  GLM,  its  use  of  the  normal 
distribution  may  lead  one  to  favor  the  simpler  model.    It  is  reassuring  to  report  that  the 
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simulation  results  for  the  two  models  are  qualitatively  nearly  identical  and  quantitatively 
very  similar  (expected  total  costs  are  within  5%  of  each  other).  Hence,  in  the  next  subsection, 
we  only  report  the  results  for  the  simple  simulation  model,  and  then  briefly  comment  on  the 
results  for  the  complex  simulation  model. 

Now  we  describe  the  model  parameters.  The  detailed  cost  estimates  contained  in  the 
field  study  of  Behets  et  al.  are  employed.  They  estimated  the  material  and  labor  cost  of 
testing  a  single  sample  to  be  $2.12,  and  the  cost  of  testing  a  pool  containing  n  >  2  samples 
to  be  $2.87  +  $0.083n.  Without  loss  of  generality,  we  normalize  these  costs  so  that  the  cost  of 
testing  a  single  sample  is  c(l)  =  1,  and  the  cost  of  testing  a  pool  of  size  n  is  c(n)  =  1.35+0.04n 
for  n  >  2.  The  false  positive  cost  cpp  and  the  false  negative  cost  c/r/v  cannot  be  as  easily 
estimated.  To  get  a  rough  estimate  for  cfp,  we  note  that  under  the  current  Red  Cross 
screening  protocol,  individuals  that  are  found  to  be  HIV  positive  during  an  initial  ELISA 
must  undergo  two  additional  ELISAs.  If  at  least  one  of  the  additional  tests  is  positive, 
then  a  highly  specific  test  (Western  Blot)  is  used  to  verify  the  individual's  serological  status. 
Since  ELISA's  specificity  is  more  than  0.99,  the  probability  that  a  noninfected  individual 
with  a  positive  initial  ELISA  test  requires  a  Western  Blot  test  is  approximately  (assuming 
successive  ELISA  results  are  independent)  1  —  0.992.  The  Western  Blot  test  is  approximately 
ten  times  as  costly  in  materials  and  labor  than  an  ELISA  test.  Hence,  the  expected  false 
positive  cost  under  the  Red  Cross  protocol  is  2  +  10(1  —  0.992)  =  2.199.  This  cost  may 
underestimate  the  true  false  positive  cost  because  successive  ELISA  results  are  not  likely  to 
be  independent,  and  the  Western  Blot  test  may  not  be  available  in  developing  countries;  in 
the  latter  case,  a  human  cost  may  be  incurred,  particularly  if  test  results  are  reported  to 
individuals.  Hence,  we  have  chosen  the  conservative  estimate  of  cfp  =  5. 

Since  a  false  negative  cost  will  contaminate  the  blood  supply,  these  costs  are  much 
larger  than  false  positive  costs  and  are  very  difficult  to  quantify.    Therefore,  we  consider 
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four  different  values  for  cfn  (100,  1000,  5000  and  10,000),  and  combined  them  with  seven 
different  values  for  the  seroprevalence  tt  (ranging  from  0.001  to  0.15)  to  generate  28  different 
scenarios  that  span  a  broad  range  of  possible  settings. 

For  each  scenario  of  the  simple  simulation  model,  we  randomly  generated  sample 
LOD  readings  using  the  seroprevalence  ir  and  the  normal  distributions  specified  earlier.  The 
simulation  terminated  at  the  first  time  after  10,000  simulated  pools  when  the  width  of  the 
95%  confidence  interval  for  the  expected  cost  dropped  below  0.2.  To  avoid  the  possibility 
of  sequential  dependencies  due  to  any  inherent  deficiencies  of  the  Turbo  Pascal  random 
generator,  the  ranO  routine  described  in  Chapter  7  of  Press  (1988)  was  used.  All  four 
policies  were  tested  on  the  same  random  sequence  of  LOD  readings. 

6.3.  Simulation  Results 

Dorfman  Policies.  We  begin  by  comparing  the  individual  testing  policy,  heuristic 
Dorfman  policy  and  optimal  Dorfman  policy;  later  in  this  subsection,  the  generalized  group 
testing  policy  will  be  considered.  Before  assessing  the  policies'  performance,  we  note  that  the 
optimal  Dorfman  policy  turned  out  to  be  of  the  form:  Continue  at  the  first  stage  if  the  LOD 
reading  is  either  above  a  cutoff  point  or  below  a  second,  extremely  small,  cutoff  point.  This 
awkward  form  arises  because  under  the  proposed  normal  LOD  distributions  with  a+  >  <r_ , 
the  far  left  tail  of  the  HIV  positive  LOD  reading  eventually  dominates  the  far  left  tail  of  the 
HIV  negative  reading.  However,  this  phenomenon  is  due  solely  to  our  normality  assumption, 
and  does  not  occur  in  Figure  2(b).  Moreover,  such  a  policy  would  never  be  implemented 
in  practice  (with  good  reason),  and  so  we  disallowed  the  option  of  continuing  for  extremely 
low  LOD  readings,  and  only  report  the  performance  of  the  Dorfman  policy  that  was  optimal 
within  the  class  of  cutoff  policies  defined  in  Subsection  5.3.  The  difference  in  performance 
between  the  overall  optimal  Dorfman  policy  and  the  optimal  cutoff  Dorfman  policy  was  very 
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small  in  our  numerical  study,  and  hereafter  we  refer  to  the  optimal  cutoff  Dorfman  policy  as 
the  optimal  Dorfman  policy. 

Our  main  results  are  reported  in  Table  2,  which  describes  the  policies,  and  Table  3, 
which  displays  their  performance  in  the  simulation  study.  The  first  column  in  Table  2 
enumerates  the  28  scenarios,  and  the  next  two  columns  characterize  the  scenarios.  The  final 
column  gives  the  pool  size  for  each  scenario,  which  was  identical  for  the  optimal  Dorfman 
procedure  and  the  heuristic  Dorfman  procedure.  The  remaining  columns  give  the  LOD 
cutoff  points  for  the  individual  testing  policy,  and  for  both  stages  (the  first  stage  is  the 
pooled  testing  stage  and  the  second  stage  is  the  individual  testing  stage)  of  both  Dorfman 
procedures.  For  each  scenario,  Table  3  gives  the  95%  confidence  interval  for  the  expected 
total  cost  of  each  policy. 

The  following  three  observations  can  be  extracted  from  our  numerical  study: 

(1)  The  optimal  and  heuristic  Dorfman  procedures  are  quite  similar.  They  both 
employ  identical  group  sizes  for  each  scenario,  and  their  cutoff  points  for  each  stage  are 
relatively  close  in  value  in  Table  2.  Rather  surprisingly,  as  seen  in  Table  3,  the  heuristic 
procedure  outperforms  the  optimal  procedure  in  23  of  the  28  cases,  and  the  expected  cost 
reduction  for  the  heuristic  procedure  relative  to  the  optimal  procedure  averages  8.1%  over 
the  28  scenarios.  As  seen  in  Table  2,  the  optimal  procedure  is  slightly  more  conservative  in 
the  choice  of  cutoff  in  the  pooled  testing  stage,  resulting  in  policies  that  are  more  sensitive, 
but  require  more  testing.  For  low  seroprevalence,  the  optimal  Dorfman  procedure  seems  to 
overcompensate  for  the  dilution  effect,  so  that  the  improved  sensitivity  does  not  counteract 
the  resulting  increase  in  monetary  testing  cost.  The  strong  performance  of  the  heuristic 
Dorfman  procedure  is  noteworthy,  since  this  policy  is  much  easier  to  derive  than  the  optimal 
Dorfman  procedure. 

(2)  Group  testing  is  optimal  for  all  28  scenarios,  and  significant  savings  over  individual 
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CUTOFF  VALUES 

Heuristic 

Optimal 

7T 

CFN 

Individual 

Stage  1 

Stage  2 

Stage  1 

Stage  2 

Pool  Size 

1 

0.001 

100 

-2.99 

-4.623 

-3.201 

-4.653 

-2.951 

15 

2 

1000 

-3.159 

-4.666 

-3.328 

-4.727 

-3.099 

12 

3 

5000 

-3.283 

-4.728 

-3.391 

-4.727 

-3.247 

12 

4 

10000 

-3.338 

-4.755 

-3.420 

-4.801 

-3.247 

12 

5 

0.005 

100 

-3.107 

-4.652 

-3.228 

-4.653 

-3.025 

13 

6 

1000 

-3.283 

-4.633 

-3.416 

-4.653 

-3.247 

8 

7 

5000 

-3.414 

-4.666 

-3.523 

-4.727 

-3.321 

7 

8 

10000 

-3.472 

-4.652 

-3.586 

-4.653 

-3.395 

6 

9 

0.01 

100 

-3.159 

-4.623 

-3.258 

-4.653 

-3.099 

10 

10 

1000 

-3.339 

-4.625 

-3.448 

-4.653 

-3.247 

7 

11 

5000 

-3.472 

-4.652 

-3.569 

-4.653 

-3.395 

6 

12 

1000 

-3.532 

-4.626 

-3.640 

-4.653 

-3.513 

5 

13 

0.025 

100 

-3.231 

-4.564 

-3.301 

-4.579 

-3.173 

7 

14 

1000 

-3.415 

-4.620 

-3.485 

-4.653 

-3.321 

6 

15 

5000 

-3.553 

-4.638 

-3.623 

-4.653 

-3.513 

5 

16 

10000 

-3.615 

-4.675 

-3.671 

-4.727 

-3.543 

5 

17 

0.05 

100 

-3.287 

-4.545 

-3.324 

-4.505 

-3.247 

5 

18 

1000 

-3.476 

-4.593 

-3.520 

-4.653 

-3.395 

5 

19 

5000 

-3.618 

-4.594 

-3.673 

-4.653 

-3.543 

4 

20 

10000 

-3.681 

-4.634 

-3.727 

-4.653 

-3.617 

4 

21 

0.1 

100 

-3.347 

-4.514 

-3.353 

-4.505 

-3.321 

5 

22 

1000 

-3.54 

-4.545 

-3.561 

-4.579 

-3.469 

4 

23 

5000 

-3.686 

-4.637 

-3.694 

-4.653 

-3.617 

4 

24 

10000 

-3.752 

-4.679 

-3.752 

-4.727 

-3.691 

4 

25 

0.15 

100 

-3.384 

-4.446 

-3.383 

-4.431 

-3.321 

4 

26 

1000 

-3.581 

-4.571 

-3.573 

-4.579 

-3.543 

4 

27 

5000 

-3.73 

-4.665 

-3.711 

-4.653 

-3.691 

4 

28 

10000 

-3.797 

-4.707 

-3.771 

-4.727 

-3.765 

4 

Table  2:  Optimal  Dorfman,  heuristic  Dorfman,  and  optimal  individual  testing  policies  for 
the  28  scenarios. 
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Expected  Total  Cost 

Individual 

Heuristic 

Optimal 

1 

1.0069  ±  0.0029 

0.2094  ±  0.0026 

0.2277  ±  0.0027 

2 

1.0076  ±  0.0012 

0.2843  ±  0.0045 

0.4139  ±  0.0061 

3 

1.0107  ±  0.0014 

0.4221  ±  0.0041 

0.4245  ±  0.0168 

4 

1.0118  ±  0.0015 

0.5136  ±  0.0044 

0.6931  ±  0.0045 

5 

1.0127  ±  0.0049 

0.2998  ±  0.0034 

0.2926  ±  0.0034 

6 

1.0318  ±  0.0278 

0.3580  ±  0.0077 

0.3771  ±  0.0103 

7 

1.1035  ±  0.0980 

0.4681  ±  0.0487 

0.5790  ±  0.0399 

8 

1.1423  ±  0.0980 

0.5314  ±  0.0980 

0.5128  ±  0.0925 

9 

1.0140  ±  0.0046 

0.3479  ±  0.0037 

0.3734  ±  0.0040 

10 

1.0528  ±  0.0392 

0.4149  ±  0.0131 

0.4521  ±  0.0163 

11 

1.0726  ±  0.0980 

0.4735  ±  0.0063 

0.5768  ±  0.0980 

12 

1.0221  ±  0.0021 

0.5382  ±  0.0980 

0.5960  ±  0.0980 

13 

1.0147  ±  0.0042 

0.4490  ±  0.0044 

0.4542  ±  0.0046 

14 

1.0761  ±  0.0480 

0.5325  ±  0.0201 

0.5731  ±  0.0239 

15 

1.3000  ±  0.0980 

0.7512  ±  0.0980 

0.7608  ±  0.0980 

16 

1.4672  ±  0.0980 

0.9689  ±  0.0980 

1.1173  ±  0.0980 

17 

1.0289  ±  0.0084 

0.5964  ±  0.0058 

0.5789  ±  0.0055 

18 

1.1663  ±  0.0759 

0.6946  ±  0.0344 

0.7783  ±  0.0417 

19 

1.5831  ±  0.0980 

0.9148  ±  0.0980 

1.1522  ±  0.0980 

20 

1.9865  ±  0.0980 

1.1504  ±  0.0980 

1.4075  ±  0.0980 

21 

1.0340  ±  0.0095 

0.7617  ±  0.0065 

0.7649  ±  0.0070 

22 

1.3203  ±  0.0980 

0.9980  ±  0.0652 

0.9683  ±  0.0590 

23 

1.8767  ±  0.0980 

1.5287  ±  0.0980 

1.5318  ±  0.0980 

24 

2.3007  ±  0.0980 

1.9354  ±  0.0980 

1.9686  ±  0.0980 

25 

1.0389  ±  0.0101 

0.9031  ±  0.0078 

0.9044  ±  0.0085 

26 

1.3777  ±  0.0980 

1.1965  ±  0.0741 

1.1573  ±  0.0688 

27 

1.9335  ±  0.0980 

1.7117  ±  0.0980 

1.7876  ±  0.0980 

28 

2.7271  ±  0.0980 

2.5229  ±  0.0980 

2.5664  ±  0.0980 

Table  3:  Simulated  performance  of  the  policies  for  the  28  scenarios. 
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testing  are  achieved.  The  expected  cost  reduction  for  the  optimal  Dorfman  procedure  relative 
to  individual  testing  ranges  from  5.9%  to  77.4%,  and  the  average  cost  reduction  over  the  28 
scenarios  in  Table  3  is  39.3%.  For  the  heuristic  Dorfman  policy,  the  expected  cost  reduction 
relative  to  individual  testing  ranges  from  7.5%  to  79.2%  and  averages  43.4%.  The  monetary 
testing  cost  is  also  significantly  reduced;  the  average  reduction  relative  to  individual  testing 
is  40%  for  the  optimal  policy  and  46%  for  the  heuristic  policy.  Moreover,  although  we  do 
not  show  the  numbers  in  Table  3,  both  Dorfman  procedures  are  highly  sensitive  and  specific. 
The  average  sensitivity  and  specificity  over  the  28  scenarios  are  99.7%  and  99.7%  for  the 
individual  testing  policy,  99.8%  and  99.7%  for  the  heuristic  Dorfman  procedure,  and  99.8% 
and  99.7%  for  the  optimal  Dorfman  procedure.  Moreover,  the  sensitivity  of  the  Dorfman 
procedures  never  dropped  below  99%. 

(3)  In  Table  2,  the  optimal  cutoff  values  for  individual  testing  are  very  similar  to 
the  optimal  cutoff  values  for  the  individual  testing  stage  of  the  two  Dorfman  procedures, 
and  range  from  -3.1  to  -3.7;  (recall  that  //_  =  -4.82,  er_  =  0.42,  //+  =  0.80  and  a+  =  1.05). 
However,  the  optimal  cutoff  values  for  the  pooled  testing  stage  of  the  two  Dorfman  procedures 
are  much  lower,  ranging  from  —4.4  to  —4.8.  Hence,  the  Dorfman  procedure  is  able  to 
maintain  its  high  test  accuracy  by  a  judicious  choice  of  cutoffs  at  each  stage;  more  specifically, 
the  cutoff  level  is  more  conservative  at  the  pooled  testing  stage  to  compensate  for  the  dilution 
effect.  In  contrast,  previous  (field  and  statistical)  researchers  in  pooled  testing  have  assumed 
that  the  the  same  cutoff  level  is  used  at  both  stages;  in  particular,  the  cutoff  level  proposed 
by  the  test  kit  manufacturer,  which  presumably  is  close  to  optimal  for  individual  testing,  is 
employed  at  both  stages  of  the  Dorfman  procedure. 

To  assess  the  performance  of  the  traditional  Dorfman  policy  that  has  been  considered 
in  previous  studies,  we  assume  that  the  optimal  cutoff  level  for  individual  testing  (i.e., 
the  fourth  column  of  Table  2)  is  employed  at  both  stages  of  the  procedure.    Under  this 
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assumption,  the  optimal  value  of  the  pool  size  n  was  derived  using  the  cost  function  (42). 
The  optimal  pool  size  was  15  for  scenario  1  and  two  for  the  other  27  scenarios.  The  average 
expected  cost  reduction  relative  to  individual  testing  was  12.78%,  which  is  much  smaller 
than  the  39%  to  43%  reduction  achieved  by  the  proposed  Dorfman  procedures. 

To  illustrate  the  predictive  power  of  our  model  with  respect  to  the  traditional  Dorfman 
policy,  we  consider  the  study  carried  out  by  Behets  et  al.  in  Kishasha,  Zaire,  where  the 
seroprevalence  of  the  8000  samples  was  2.44%.  The  traditional  Dorfman  procedure  with 
pools  of  size  ten  reduced  the  monetary  cost  of  HIV  screening  by  56%  relative  to  individual 
testing;  however,  six  low  reactivity  individuals  were  not  detected.  We  used  the  Monte  Carlo 
simulation  model  to  calculate  the  performance  of  the  traditional  Dorfman  procedure  that 
employed  the  individual  testing  cutoff  of  scenario  15  at  both  stages.  The  expected  sensitivity 
of  the  procedure  were  96.4%  and  the  expected  monetary  testing  cost  was  0.40  (recall  that 
our  testing  cost  function  c(n)  is  based  on  the  cost  model  in  Behets  et  al.);  hence,  our 
analysis  predicts  a  60%  reduction  in  monetary  testing  cost  and  (0.025) (0.036) (8000)=7. 2 
false  negatives.  Therefore,  the  model  accurately  captures  both  the  magnitude  of  the  cost 
savings  and  the  extent  of  the  dilution  effect  as  manifested  by  the  low  reactivity  individuals 
that  are  not  detectable  in  pools.  Under  the  heuristic  Dorfman  policy  for  scenario  13,  the 
expected  monetary  testing  cost  is  0.43  and  the  sensitivity  is  99.81%,  and  hence  the  expected 
number  of  false  negatives  is  (0.025)(0.0019)(8000)=0.38.  Therefore,  we  predict  that  the 
heuristic  Dorfman  procedure  would  not  have  had  any  trouble  detecting  the  low  reactivity 
individuals. 

Additional  scenarios  were  considered  to  generate  Figure  8,  which  provides  switching 
curves  depicting  the  optimal  group  size  (as  calculated  by  the  heuristic  Dorfman  procedure)  as 
a  function  of  both  the  seroprevalence  and  the  false  negative  cost.  As  expected,  the  group  size 
is  a  decreasing  function  of  both  quantities;  if  the  seroprevalence  is  high,  then  large  group  sizes 
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will  contain  HIV  positive  individuals  with  high  probability.  Similarly,  if  the  false  negative 
cost  is  high,  then  smaller  group  sizes  are  required  to  diminish  the  impact  of  dilution.  Notice 
that  groups  of  size  two  or  three  are  never  optimal.  This  phenomenon  is  due  to  the  pooled 
testing  cost  c(n)  =  1.35  +  0.04n:  The  cost  of  constructing  the  pool  is  larger  than  the  limited 
savings  realized  when  the  group  size  is  two  or  three.  Finally,  in  contrast  to  the  case  of  perfect 
binary  tests,  where  group  testing  is  optimal  if  and  only  if  the  proportion  of  defective  items  is 
less  than  (3  —  VE)/2  «  0.382  (see  Ungar  1960),  group  testing  is  only  optimal  in  Figure  8  for 
significantly  lower  seroprevalences.  The  breakeven  (between  individual  and  group  testing) 
seroprevalence  in  Figure  8  is  a  nonincreasing  function  of  Cj?n,  and  for  c^/v  >  100  it  is  less 
than  or  equal  to  0.18.  The  gap  in  breakeven  seroprevalence  between  0.18  and  0.382  is  due  to 
the  form  of  our  pooled  testing  cost  (the  traditional  cost  is  c(n)  =  1  for  all  n)  and  the  presence 
of  statistical  errors,  which  becomes  increasingly  important  as  seroprevalence  increases  and 
leads  to  a  more  conservative  choice  of  group  size. 

The  results  for  the  complex  simulation  model  described  in  Subsection  6.2,  although 
not  shown  here,  are  consistent  with  the  results  from  the  simple  simulation  model.  The 
average  sensitivity  and  specificity  over  the  28  scenarios  are  99.8%  and  99.7%  for  the  heuristic, 
and  99.8%  and  99.8%  for  the  optimal.  Relative  to  the  individual  testing  policy,  the  average 
expected  cost  reduction  over  the  28  scenarios  is  42.4%  for  the  optimal  Dorfman  procedure  and 
46.1%  for  the  heuristic  Dorfman  procedure,  which  are  slightly  larger  than  the  corresponding 
values  in  the  simple  simulation  model.  The  only  qualitatively  different  result  for  the  more 
complex  simulation  model  occurs  in  scenario  4,  where  the  low  seroprevalence  and  high  false 
negative  cost  lead  to  a  very  conservative  cutoff  at  the  pooled  testing  stage  for  the  optimal 
Dorfman  procedure  (see  Table  2).  Consequently,  the  optimal  procedure  performed  too  many 
individual  tests  and  fared  poorly.  When  the  optimal  pooled  cutoff  is  increased  from  -4.801  to 
-4.76,  the  monetary  testing  cost  drops  drastically,  while  the  sensitivity  and  specificity  remain 
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Figure  8:  Optimal  group  size  as  a  function  of  Cfn  and  it. 

unchanged.  Hence,  the  optimal  Dorfman  procedure  is  not  robust  when  the  seroprevalence 
is  very  low  and  the  false  negative  cost  is  very  high,  at  least  under  the  complex  simulation 
model. 

Generalized  Group  Testing  Policies.  We  have  thus  far  focused  on  the  Dorfman 
policy,  due  to  its  simplicity  and  ease  of  use.  However,  it  is  worthwhile  comparing  the 
relative  performance  of  the  optimal  Dorfman  policy  and  a  generalized  group  testing  policy. 
Recall  that  the  dynamic  programming  algorithm  finds  the  optimal  generalized  group  testing 
policy  for  a  given  initial  pool  size  and  subpool  configuration.  To  solve  a  different  dynamic 
program  for  each  possible  initial  pool  size  and  subpool  configuration  is  computationally 
prohibitive.  Instead,  we  investigate  the  performance  of  the  generalized  group  testing  policy 
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for  two  scenarios.  For  scenario  19,  where  the  optimal  Dorfman  pool  size  is  four,  we  let  the 
initial  pool  size  n\  =  4,  and  consider  the  subpool  configuration  in  Figure  7.  For  scenario  6, 
where  the  optimal  Dorfman  pool  size  is  8,  we  let  the  initial  pool  size  ri\  =  8,  and  consider 
the  subpool  configurations  (a)  N  =  2,  a.\  =  2  and  a2  =  4,  and  (b)  N  —  3  and  a{  =  a2  — 
a3  =  2.  For  scenario  19,  the  expected  total  cost  of  the  generalized  multistage  procedure  was 
1.285  ±  0.0098,  which  is  higher  then  the  cost  of  either  of  the  proposed  Dorfman  policies.  For 
scenario  6,  subpool  configuration  (a)  outperforms  configuration  (b),  and  yields  an  expected 
total  cost  of  0.3558  ±  0.014.  This  corresponds  to  a  5.6%  expected  cost  reduction  relative  to 
the  optimal  Dorfman  procedure,  and  a  0.6%  reduction  over  the  heuristic  policy.  In  summary, 
although  it  is  possible  to  obtain  generalized  multistage  policies  that  outperform  the  Dorfman 
procedures,  the  additional  improvement  appears  to  be  offset  by  the  difficulty  in  deriving  and 
implementing  these  policies. 

6.4.  Application 

The  numerical  results  in  Subsection  6.3  (for  example,  the  switching  curves  in  Figure  8) 
cannot  be  universally  applied  for  several  reasons.  Our  numerical  results  depend  upon  the 
HIV  positive  and  HIV  negative  distributions,  which  may  differ  across  the  world,  due  to 
seroconversion  rates  (a  larger  seroconversion  rate  may  lead  to  a  fatter  left  tail  of  the  LOD 
readings  for  HIV  positive  individuals)  or  the  particular  strain  of  virus  that  is  prevalent.  Also, 
the  testing  cost  c(n)  may  depend  upon  various  economic  factors  that  are  distinctive  to  each 
country.  Nevertheless,  we  will  loosely  apply  our  results  in  a  documented  setting  to  obtain  a 
rough  estimate  of  the  benefits  that  are  achievable  from  group  testing. 

In  Kishasha,  Zaire,  of  the  3741  units  of  blood  transfused  in  February  1990,  1045 
(27.9%)  were  not  screened  for  HIV  infection  (see  N'tita  et  al.).  Assuming  that  this  was  a 
consequence  of  budget  constraints,  we  can  propose  an  alternative  strategy  that  will  reallocate 


48 


funds  across  the  transfusion  centers  so  that  every  blood  donor  can  be  tested  for  antibodies 
to  HIV.  Since  72.1%  of  the  units  were  individually  tested,  the  monetary  testing  cost  of  the 
currently  implemented  policy  is  0.721.  Seroprevalence  in  Zaire  is  estimated  to  be  about 
2.5%  (see  Behets  et  al.).  Suppose  that  the  individual  testing  policy  under  scenario  15 
was  employed  on  72.1%  of  the  units.  Since  the  expected  sensitivity  and  specificity  of  this 
policy  are  99.8%  and  99.6%,  respectively,  the  expected  number  of  infected  units  that  are 
transfused  is  (0.025)(1045)  +  (0.025) (0.002) (2696)  =  26.25,  and  the  expected  number  of  false 
positives  is  (0.975) (0.004) (2696)  =  10.24.  If  we  use  the  heuristic  Dorfman  policy  under 
scenario  15,  then  the  monetary  testing  cost  is  0.54,  the  expected  sensitivity  is  99.8%  and 
the  expected  specificity  is  99.6%.  Hence,  the  expected  number  of  infected  units  that  are 
transfused  is  (0.025)(0.002)(3741)  =  0.18,  and  the  expected  number  of  false  positives  is 
(0.975)(0.004)(3741)=14.6.  In  summary,  pooled  testing  in  this  setting  reduces  the  monetary 
testing  cost  by  25%  and  reduces  the  expected  number  of  infected  units  transfused  from  26 
to  essentially  zero.  It  is  clear  that  pooled  testing,  if  used  properly,  can  save  hundreds  of  lives 
worldwide. 

7.  Concluding  Remarks 

We  have  developed  and  validated  a  mathematical  model  that  captures  the  dilution 
effect  that  occurs  when  HIV  positive  sera  are  pooled  with  HIV  negative  sera.  The  general- 
ized linear  models  (13)  and  (21)  develop  new  insights  into  the  nature  of  the  dilution  effect, 
and  avoid  the  heteroscedasticity  problem  that  has  plagued  the  traditional  regression  mod- 
els obtained  through  a  purely  empirical  approach.  These  GLMs  and  the  simplified  pooling 
model  (24)  may  be  useful  for  other  applications  besides  group  testing,  and  our  general  ap- 
proach may  be  applicable  whenever  pooled  testing  is  used  to  identify  a  disease  or  contaminant 
in  a  liquid. 
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Our  numerical  results  suggest  that  the  heuristic  Dorfman  policy  derived  in  Subsection 
5.3  provides  a  cost-effective,  accurate  and  relatively  simple  alternative  to  currently  imple- 
mented HIV  screening  protocols.  This  policy  can  be  used  in  developing  countries  to  safeguard 
the  integrity  of  the  blood  supply,  and  consequently  reduce  the  spread  of  the  AIDS  epidemic. 
While  existing  field  studies  and  mathematical  analyses  assume  that  the  same  cutoff  point  is 
used  to  classify  the  pool  at  both  stages  of  the  Dorfman  procedure,  our  analysis  shows  that 
only  by  selecting  a  different  cutoff  point  at  each  stage  can  we  ensure  that  the  sensitivity  of 
the  test  is  not  compromised.  Finally,  HIV  testing  is  also  extensively  used  for  seroprevalence 
estimation;  in  a  companion  paper,  we  show  how  the  pooling  model  developed  and  validated 
here  can  be  employed  to  derive  efficient  seroprevalence  estimates. 
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Appendix 

Proof  of  Proposition  1.  By  definition,  for  s  =  0, 1,2, 

Vi>a+1    =    2*+1(V£+1-aO  (44) 

=    2s+l(l-Y?3  +  ehS+l-n)  (45) 

=    Vis  +  2s(2ei>3+l-fJ>)  (46) 
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=    Vi3  +  ciiS+1,  (47) 

where  Ci,,+i  is  a  random  variable  with  zero  mean.  ■ 

Proof  of  Proposition  2.  Since  E(Vlj(x)  —  Vl0(x))2  is  positive  semi-  definite  quadratic 
form,  the  minimum  is  derived  from  the  first  order  conditions.  We  have 

E(Vu(x)  -  Ko(x))2    =    E[(Vis  -  V'l0  +  (2°  -  \)(jjl  -  x))2}  (48) 

=    E[(yu  -  Ko)2]  +  2E[(VU  -  ViQ){2*  -  l)fji  -  x))\ 
+    (2s-l)2(fi-x)2,  (49) 

and  therefore 

^E(Vu{x)- Vio(x))2    =    -2(25-l)E[(K,-Ko)]+2(2i-l)2(x-/i)  (50) 

=    2(2'-l)2(x-^),  (51) 

since  E(Vi3)  =  E(Vi0)  by  Proposition  1.  Thus,  the  minimum  is  attained  at  x  =  /z.  ■ 

Proof  of  Proposition  3.    The  result  follows  by  calculating  E(p.)  in  equation  (29) 

and  using  Proposition  1,  i.e.,  E(Vi3  -  Vi0)  =  0.  ■ 

Proof  of  Proposition  4.    We  want  to  obtain  the  set  of  nonnegative  weights  Wi 
minimizing  the  sample  variance  of  the  estimator  p..  Equation  (29)  can  be  reexpressed  as 


sSig.i«.(i-2,)(y<»-  Wo) 

10£LiMl-2*)2] 


M=M ,n^3      r.      ^ ^7T^ •  (°2) 


Since  different  samples  are  independent,  the  optimal  choice  of  weights  is  given  by  the  solution 
to  the  following  minimization  problem: 

minimize    E[£,3s=1w3(l  -  2"){Via  -  Vi0)]2  (53) 

subject  to     YL]=\  ws{\  -  2s)2  =  constant  (54) 

ws  >0. 
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The  minimization  problem  can  be  simplified  by  defining  x3  =  ws(l  —  2s),  so  that  the  objective 
function  (53)  becomes 

3  3  3 

£K*2(Vi.  -  Ko)2]  +  2EQ2    Yl   xkx,{Vu  -  Vi0)(Vik  -  Ko)].  (55) 

3=1  s=\k=s+\ 

If  k  >  s,  then 

E[(Vis-Vl0)(Vlk-Vl0)}    =  E[E[(Vls-Vl0)(Vlk-Vl0)\Vls}}  (56) 

=  E[(Vls-Vl0)E[(Vlk-Vl0)\Vls}  (57) 

=  E[(VU  -  Vio)2]  (58) 

=  Var(K,)-  (59) 

Equation  (56)  follows  from  the  law  of  total  probability  for  conditional  expectations, 
and  equations  (58)  and  (59)  follow  from  the  martingale  property  of  the  driftless  random 
walk  Vij.  Combining  equations  (55)  and  (59)  gives  the  following,  simplified  version  of  the 
minimization  problem: 

minimize    £'=i  x23Va,r(Vis)  +  2  £'=i  (xaVarV^EJL+i  **))  (60) 

subject  to  ELi(2'  -  l)x,  =  c  (61) 

xs  >  0, 

where  c  is  the  normalization  constant. 

The  optimal  solution  is  obtained  by  formulating  the  Lagrangian 

3  3  3  3 

L(x,\)  =  £x2Vai(Vi,)  +  2Y,(xaVai(Via)   £   xfc)+2A(c- £(2' -  l)xa).  (62) 

*=1  3=1  k=S+l  3=1 

The  Karush-Kuhn-Tucker  optimality  conditions  state  that  if  there  exists  A  and  nonnegative 
weights  ws  satisfying  ££-  =  0  and  (61),  then  w  is  the  optimal  solution  of  the  minimization 
problem.  This  derivative  can  be  written  as 

Var(K5)  £  xk  +  £  ^Var(Kifc)  =  A(2*  -  1).  (63) 

k=s  fc=l 
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By  the  orthogonality  property  of  martingales  (Williams  1991,  Chapter  12), 

Var(Vls)  =  Var(t&)  +  Vax(Vu  -  Vik)  (64) 

for  all  s  >  k.  Combining  equation  (63)  and  (64),  the  condition  J^-  =  0  is  reformulated  as 

Vai(Via)  £  xk  +  J2  xkVa.r(Vl3  -  Vik)  =  X{V  -  1).  (65) 

The  Karush-Kuhn-Tucker  conditions  are  more  conveniently  formulated  using  matrix 
notation.  Let  U  be  the  3x3  matrix  defined  by  Usk  =  Var(V„  —  Vr,*:)l(J,>fc),  u  be  the  3x1 
vector  defined  by  us  =  2s  —  1,  v  be  the  3x1  vector  given  by  vs  =  Var(Vis)  and  e  be  the  3x1 
unit  vector.  The  optimality  conditions  can  then  be  expressed  as 

(xTe)v  +  Ux    =    Xu  (66) 

uTx    =    c  (67) 

x    >    0.  (68) 

In  order  to  obtain  the  optimal  vector  x,  the  vector  quantities  U  and  v  should  be  determined. 
For  the  random  walk  model  described  in  Subsection  4.3, 

v3  =  a2(F-l).  (69) 

From  the  first  row  of  equation  (66),  we  obtain 

axTe  =  X,  (70) 

since  the  first  row  of  U  is  the  zero  vector  and  Ui  =  1.  Therefore,  by  combining  equations  (69) 
and  (70),  equation  (66)  becomes  Ux  =  0.  Hence,  Xi  =  X2  =  0,X3  =  f  and  A  =  cr2X3  satisfy 
the  optimality  conditions.  This  completes  the  proof  of  Proposition  of  4.  The  most  efficient 
estimator  is 

-  _     Et  =  l(8^"iN  -  Vjp)  /71\ 

p-  70         •  i'i; 
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Proof  of  Lemma  1:  The  proof  is  by  induction.  The  statement  is  true  when  j  = 
N;  assume  that  it  is  true  for  j  =  k.  Using  the  law  of  total  probability  for  conditional 
expectations  and  the  fact  that  all  individuals  in  the  pool  are  indistinguishable,  we  can 
write  £[/ry-  €P  JSfc_i]  =  E  E[LY  €P  \  \Sk]\Sk-\  ,  and  by  the  induction  hypothesis,  the  right 
side  is  equal  to  E\E[I,Yk&pAYk]\Sk-\  •  By  Corollary  2,  this  expression  is  equivalent  to 
E  \E[Ly  €P  i|Vfc]|Yjfc_i  ,  which  depends  only  on  the  latest  LOD  observation  Yfc-i-  B 

Proof  of  Proposition  6:  The  proof  is  by  induction.  Recall  that  from  equation  (34) 
and  Proposition  5, 

Jj(Xj)  =  min{aJ+1(c(nJ+2)  +  E[J]+l(YJ+l)\Yj}), 

nJ+lcFNP(Y,  e  P+IYfrnj+iCppPiYj  €  P-lYj)}    for  j  =  0,...  ,N-  1.         (72) 

Let  Jj{Yj)  =  JjiYj-nj+iCFNPiYj  g  P+|^-).  and  F(YN)  =  cFPP(YN  g  P_\YN)-cFNP(YN  g 
P+\YN).  By  the  law  of  total  probability  and  the  fact  that  all  individuals  in  the  pool  are 
indistinguishable, 

H  aj+lE[Jj+l(Yj+l)\Yj\   -nj+lCFNP(YjeP+\Yj) 

=    aJ+\E  Jj+i(Yj+i)  —  tij+2CfnP{Yj+\  g  P+\Y]+\)\Y3 

=    aJ+1E[JJ+1(yj+1)l^l,  (73) 


and 


nj+l  (cFPP(Yj  e  PAY,)  -  CFNPiYj  e  P+\Yj)) 
=    n]+lE  [cFPP(YN  g  P.\YN)  -  cFnP(Yn  e  P+\YN)\Y,_ 
=    nj+1E[F(YN)\Yj}.  (74) 

Subtracting  Uj+iCfnP(Y]  g  P+\Yj)  from  both  sides  of  (71)  yields 

Jj(Yj)  =  irw{aj+Mnj+2)  +  E[jj+1(Yj+l)\Y^  (75) 
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Straightforward  algebraic  manipulations  show  that 

F(YN)  = .  J\(Yn)  ■  (76) 

hence,  F(Y)v)  is  monotone  nonincreasing  by  the  assumed  monotonicity  of  y-.  Since  J/v(V/v)  = 
min{0,  F(V,v)},  the  function  Jn(Yn)  is  monotone  nonincreasing  and  the  unique  root  of 
F(V^v)  gives  the  optimal  cutoff  c^  for  stage  N. 

To  prove  inductively  that  RJ  =  {Yj  :  Yj  <  cj}  for  some  cj ,  let  us  assume  that 
Jj+i(Yj+\)  is  monotone  nonincreasing.  Then  by  the  Mon(j')  property,  the  nonzero  terms 
on  the  right  side  of  (74)  are  also  monotone  nonincreasing.  Therefore,  there  exists  cj  (the 
minimum  of  the  roots  of  the  two  terms)  such  that  J3{Yj)  =  0  if  and  only  if  Y3  <  c~;  thus, 
RJ  =  {Yj  :  Yj  <  c~).  Moreover,  Jj(Yj)  is  monotone  nonincreasing.  This  completes  the  first 
part  of  the  proof.  Similar  arguments  establish  that  Rf  =  {Y}  :  Yj  >  c+}.  ■ 

Proof  of  Proposition  7.  It  is  known  that  E{X)  =  J^P{X  >  x)dx  and  E(X\Z)  = 
/R  P(X  >  x\Z)dx.  For  a  nondecreasing  function  h  and  Z\  >  Z2, 

E[h{Xx)\Z  =  Zl]  -  E[h{Xi)\Z  =  z2]  = 
=     [  P(h(Xi)  >  x\Z  =  zx)dx  -  f  P(h(Xi)  >  x\Z  =  z2)dx  (77) 

=     I {P{Xi  <h-\x)\Z  =  zl)-P(Xl  <h-\x)\Z  =  z2)}dx  (78) 

<    0. 


References: 

Arnold,  S.F.  1977.  Generalized  Group  Testing.  Annals  of  Statistics  5,  1170-1182. 

Behets,  F.,  S.  Bertozzi,  M.  Kasali,  M.  Kashamuka,  L.  Atikala,  C.  Brown,  R.  W.  Ryder  and 
C.  Quinn.  1990.  Successful  use  of  Pooled  Sera  to  Determine  HIV-1  Seroprevalence  in  Zaire 

55 


with  Development  of  Cost-Efficiency  Models.  AIDS  4,  737-741. 

Burns,  K.  C.  and  C.  A.  Mauro.  1987.  Group  Testing  with  Test  Error  as  a  Function  of 
Concentration.  Commun.  Statist. -Theory  Meth.  16,  2821-2837. 

Cahoon-Young,  B.,  A.  Chandler,  T.  Livermore,  J.  Gaudino  and  Ft.  Benjamin.  1989.  Sensi- 
tivity and  Specificity  of  Pooled  Versus  Individual  Sera  in  Human  Immunodeficiency  Virus 
Antibody  Prevalence  Study.  J.  Clinical  Microbiology  27,  1893-1895. 

Cahoon-Young,  B.  A.  Chandler,  T.  Livermore,  J.  Gaudino  and  R.  Benjamin  1992.  Optimal 
Pool  Size  for  Determination  of  HIV  Prevalence  in  Low  Risk  Populations.  Presented  at  the 
HIV/ AIDS  Surveillance  Workshop.  South  San  Fransisco,  CA. 

Cox,  D.  R.  and  D.  V.  Hinkley.  1974.  Theoretical  Statistics.  Chapman  and  Hall,  London. 

Dax,  E.  M.  Director,  National  HIV  Reference  Laboratory,  Melbourne,  Australia.  1993.  Pri- 
vate Correspondence. 

de  Gourville,  E.  Research  Associate,  CAREC,  Trinidad  W.I.  1992.  Private  Correspondence. 

Dorfman,  R.  1943.  The  Detection  of  Defective  Members  of  Large  Populations.  Ann.  Math. 
Stat.  44,  436-441. 

Emmanuel,  J.  C,  M.  T.  Bassett,  H.  J.  Smith  and  J.  A.  Jacobs.  1988.  Pooling  of  Sera  for 
Human  Immunodeficiency  Virus  (HIV)  Testing:  An  Economical  Method  for  use  in  Develop- 
ing Countries.  J.  Clinical  Pathology  41,  582-585. 

Fisher,  R.  A.  1922.  On  the  Mathematical  Foundations  of  Theoretical  Statistics.  Phil.  Trans. 
R.  Soc.  222,  309-368. 


56 


George,  R.  J.  Chief.  Dev.  Technology  Section.  Division  HIV/AIDS,  Center  for  Disease  Con- 
trol, Atlanta,  GA.  1992.  Private  Correspondence. 

George,  Ft.  J.  and  G.  Schochetman.  1985.  Serological  Tests  for  the  Detection  of  HIV  In- 
fection. In  AIDS  Testing,  Methodology  and  Management  Issues,  G.  Schochetman  and  J.  R. 
George,  (ed.),  Springer- Verlag,  New  York,  49-69. 

Hastie,  T.J.  and  D.  Pregibon.  1992.  Generalized  Linear  Models.  In  Statistical  Models  in  S , 
J.M.  Chambers  and  T.J.  Hastie,  (ed.).  Wadsworth  &  Brooks/Cole  Computer  Science  Series, 
California,  195-247. 

Hull,  B.  1991.  Serum  Pooling  for  HIV  Screening  in  Trinidad  and  Tobago.  Carribean  Epidi- 
mology  Center  Technical  Report. 

Hwang,  F.  K.  1976.  Group  Testing  with  a  Dilution  Effect.  Biometnka  63,  671-673. 

Hwang,  F.  K.  1984.  Robust  Group  Testing.  J.  Quality  Technology  16,  189-195. 

Johnson,  N.  L.,  S.  Kotz  and  X.  Wu.  1991  Inspection  Errors  for  Attributes  in  Quality  Control. 
Chapman  and  Hall,  London. 

Kline,  R.  L.,  T.  A.  Brothers,  R.  Brookmeyer,  S.  Zegger  and  T.C.  Quinn.  1989.  Evaluation 
of  Human  Immunodeficiency  Virus  Seroprevalence  in  Population  Surveys  using  Pooled  Sera. 
J.  Clinical  Microbiology  27,  1449-1452. 

Ledro-Monroy,  G.  and  E.  Archbold.  1990.  HIV  Serum  Pooling  Study.  Cruz  Roja  Ecuatori- 
ana. 

Litvak,  E.,  X.  M.  Tu  and  M.  Pagano.   1992.  Screening  for  the  Presence  of  HIV  by  Pooling 


57 


Sera  Samples:  Simplified  Procedures.  Working  Paper,  Harvard  School  of  Public  Health. 

Madansky,  A.  1988.  Prescriptions  for  Working  Statisticians.  Springer- Verlag,  New  York. 

McCullagh,  P.  and  J.  A.  Nelder.   1989  Generalized  Linear  Models,  2nd  Edition.   Chapman 
and  Hall,  London. 

N'tita,  I.,  K.  Mulunga,  C.  Dulat,  D.  Lusamba,  T.  Rehle,  R.  Korte  and  H.  Jagger.    1991. 
Risk  of  Transfusion-Associated  HIV  Transmission  in  Kishasha,  Zaire.  AIDS  5,  437-439. 

Press,  W.  H.,  B.  P.  Flanney,  S.  A.  Teukolsky  and  W.  T.  Vetterling.  1988.  Numerical  Recipes 
in  C,  The  Art  of  Scientific  Computing.  Cambridge  University  Press,  Cambridge. 

Tamashiro,  H.,  W.  Maskill,  J.  Emmanuel,  A.  Fauquex,  P.  Sato  and  D.  Heymann.    1993. 
Reducing  the  cost  of  HIV  antibody  testing.  Lancet  342,  87-90. 

Thompson,  K.H.  1962.   Estimation  of  the  proportion  of  vectors  in  a  natural  population  of 
insects.  Biometrics  18,  568-578. 

Tijssen,  P.  1985.   Laboratory  Techniques  in  Biochemistry  and  Molecular  Biology.   Vol.   15. 
Elsevier,  Amsterdam. 

Unger,  P.  1960.   The  cutoff  point  for  group  testing.    Communication  on  Pure  and  Applied 
Mathematics  13,  49-54. 

Williams,  D.  1991.   Probability  with  Martingales.  Cambridge  University  Press,  Cambridge, 
England. 


58 


MIT    LIBRARIES    OUPL 


3  TDfiO  OOflSbbEl  5 


21:25    036 


Date  Due 


■-■ 
AUG  i  o  iggg 


Lib-26-67 


